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The  estimation  of  flow  quantiles  and  the  regionalization  of  hydrologic 
characteristics  are  two  major  foci  of  current  hydrologic  research.  Probabilistic 
representation  of  a  few  individual  quantiles,  notably  the  annual  flood  and  low-flow, 
have  been  devised,  and  recently  a  few  of  the  larger  quantiles  have  been  modelled. 
This  research  describes  and  demonstrates  the  use  of  a  probability  model  for  all  flow 
quantiles  in  a  river. 

A  flow-duration  curve  represents  the  annual  flow-frequency  characteristics  of 
rivers  by  depicting  the  cumulative  frequencies  for  average  ranked  flows  in  a  river. 
Generally  the  process  requires  the  empirical  estimations  of  the  mean  flow  at  each 
of  three  hundred  and  sixty  five  ranks.  A  model  requiring  only  five  parameters  is 
developed  by  combining  the  principles  of  order  statistics  and  traditional  flow- 
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frequency  analyses  and  is  applied  to  flow-duration  curves  for  rivers  in  the  Province 
of  British  Columbia,  Canada. 

Results  from  the  model  may  be  interpreted  both  statistically  and  physically 
and  allow  the  identification  of  hydrologically  similar  regions.  Streamflows  in  British 
Columbia  are  generated  from  a  number  of  distinct  physical  processes  operating  in 
highly  variable  environments.  Two  of  the  primary  generating  processes  active  in 
British  Columbia  are  rainfall  and  spring  snowmelt,  each  of  which  generates  differing 
hydrograph  responses  from  a  given  basin.  These  processes  are  not  exclusive  in  when 
and  where  they  operate  and  combine  with  other  processes,  such  as  summer  glacier 
melt,  to  create  highly  complex  flow  frequency  regimes. 

The  mixing  of  the  streamflow  generating  processes  requires  that  the  statistical 
model  be  capable  of  encompassing  the  different  flow-frequency  regimes. 
Identification  of  hydrologic  regions  also  depends  on  the  inclusion  of  these  generating 
processes.  The  model  presented  incorporates  the  physical  generating  processes  of 
streamflow  in  both  the  statistical  representation  of  flow-duration  curves  and  their 
interpretation.  Similarly,  the  spatial  model  presents  hydrologic  regions  which 
correspond  to  the  known  physical  environment. 

Estimation  of  spatially  continuous,  hydrologic  variables  over  a  region  permits 
the  prediction  of  the  parameters  of  the  model  for  flow-duration  curves  at  ungaged 
locations.  A  model  for  the  estimation  and  generalization  of  flow-duration  curves  for 
rivers  in  British  Columbia,  Canada,  is  developed  and  tested. 


CHAPTER  1 
INTRODUCTION 


Hydrologic  research  has  investigated  several  aspects  of  streamflow  by 
examining  just  a  small  part  of  the  flow-frequency  regime  for  a  river.  Concentration 
on  the  annual  flood  or  annual  low-flow,  which  are  just  segments  of  the  complete 
regime,  is  well  estabhshed.  Current  research  in  hydrology  is  begirming  to  examine 
more  of  the  flow-frequency  regime  by  including  additional  quantiles  (Smith,  1989). 
The  entire  flow-frequency  regime  has  traditionally  been  summarized  by  a  flow- 
duration  curve,  which  describes  the  gySTf  £&  Sr^lf  ^9J^"|^^^^  ^^  ^^^  quantiles  in  a  year. 

in  2010  with  funding  from 
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Flow-Duration  Curves 

A  flow-duration  curve  represents  the  proportion  of  time  that  a  particular 
discharge  in  a  river  is  observed  to  be  equalled  or  exceeded  during  the  period  of 
observation.  Discharge,  which  is  a  continuous  variable,  may  be  averaged  over  hourly, 
daily,  weekly  or  monthly  time  periods.  The  period  of  observation  may  be  daily, 
weekly,  monthly,  annual  or  the  entire  recorded  time  history  of  the  river. 

The  flow-duration  curve  has  the  measure  of  discharge  on  the  y-axis,  and  the 
proportion  of  time  that  discharge  was  equalled  or  exceeded  on  the  x-axis.  It  presents 
a  cumulative  frequency  curve  for  the  magnitude  of  flows  in  any  river. 
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Use  of  Flow-Duration  Curves 

Traditional  engineering  applications  of  flow-duration  curves  are  evaluation  of 
water  supply  potentials  of  a  river  for  reservoir  design  and  hydroelectric  power 
generation  (Chow,  1964;  Foster,  1934;  Searcy,  1959).  Many  water  resources  planning 
and  design  activities  at  some  point  consider  the  flow-frequency  behavior  of  a  river- 
either  the  highest  flows,  the  lowest  flows,  the  intermediate  flows  or  the  average 
characteristics.  Flow-duration  curves  also  provide  a  measure  of  the  magnitude  and 
frequency  of  stream  power,  and  hence  the  erosive  capacity  of  the  stream  and  its 
ability  to  generate  hydroelectric  power  (Foster,  1934;  Leopold  et  al.,  1964;  Richards, 
1982). 

Most  commonly,  operational  hydrology  has  been  concerned  with  the  risks 
associated  with  the  largest  flow  in  a  year  (annual  flood),  and  more  recently,  the 
smallest  flows.  However,  little  interest  has  been  shown  in  the  use  of  flow-duration 
curves  to  evaluate  these  and  other  flow  frequencies  principally  because  of  the  lack 
of  a  theoretical  basis  for  their  study. 

Justification  and  Objectives 

Potter  (1987)  identifies  two  major  areas  of  research  in  operational  hydrology: 
1)  statistical  estimation  of  flood  quantiles,  and  2)  regionalization. 


Ouantile  Estimation 

In  the  last  ten  years  flood  frequency  analysis  has  concentrated  on  the  choice 
of  probability  distributions  to  model  the  annual  flood.  Commonly  used  distributions 
include  the  Log  Pearson  Type  III  (Water  Resources  Council,  1967),  the  generalized 
extreme  value  distribution  (Jenkinson,  1954  &  1969),  the  Wakeby  distribution 
(Houghton,  1978),  the  three  extreme  values  types  (Potter,  1987)  and  the  lognormal 
(Chow,  1954).  The  overriding  criterion  for  success  in  each  study  has  been  the  ability 
to  model  annual  flood  frequencies  using  the  best-fit  distribution  (Cunnane,  1985). 
Where  aimual  flood  frequency  behavior  has  become  too  complex  for  a  single 
distribution  to  model,  additional  processes  have  been  postulated,  and  mixture  models 
developed  (Rossi  et  ah,  1984;  Waylen  &  Woo,  1982).  Despite  the  success  of  these 
models,  there  remains  a  need  for  new  approaches  and  ideas  towards  the  scientific 
understanding  of  flood  frequency,  which  include  both  strong  theoretical  and  physical 
bases  in  addition  to  goodness-of-fit  (KlemeS,  1986;  Potter,  1987). 

This  research  examines  all  daily  flows  generated  by  a  river  within  a  year,  a 
population  from  which  annual  floods  and  low-flows  are  only  special  cases  which  can 
be  incorporated  into  a  more  general  model.  The  aimual  flood  by  definition  is  the 
largest  flow  observed  in  a  year;  similarly  all  the  other  flows  within  a  year  may  be 
ranked  and  the  probabilities  of  the  discharges  at  each  rank  investigated.  The  flows 
in  a  river  are  considered  to  reflect  the  basin  inputs  and  the  processes  active  within 
the  basin.  Their  probabilities  should  therefore  reflect  changing  hydrologic 
environments. 


Regionalization 

The  identification  of  hydrologic  regions  is  intrinsically  geographic  and  has 
practical  application  in  the  estimation  of  characteristics  for  ungaged  basins.  Recent 
research  has  specifically  focussed  on  the  investigation  and  identification  of 
homogenous  regions  of  flow  generating  processes  and  basin  behavior  (Wiltshire, 
1986a).  Techniques  employed  include  complex  regression  models  (Leith,  1975), 
index  flood  approaches  (Dalrymple,  1960;  Natural  Environment  Research  Council, 
1975)  and  mapping  the  parameters  of  probability  distributions  (Matalas  et  al.,  1975). 
The  identification  of  homogenous  regions  requires  the  arbitrary  location  of 
boundaries  on  to  what  are  essentially  continuous  and  scale-dependent  sets  of 
variables.  This  work  acknowledges  the  spatial  heterogeneity  of  hydrologic  processes 
and  attempts  to  model  flow-frequency  indicators  along  a  spatial  continuum. 

Objectives 

This  dissertation  has  three  major  objectives:  1)  to  select  a  flow-duration  curve 
construction  technique  which  corresponds  to  traditional  hydrologic  measures  of  risk, 
and  lends  itself  to  the  subsequent  objectives;  2)  to  build  a  theoretically  based 
quantitative  model  for  the  form  of  flow-duration  curves  which  summarizes  the 
probability  that  a  discharge  will  be  equalled  or  exceeded  for  a  certain  proportion  of 
the  year;  and  3)  to  develop  a  spatial  technique  by  which  parameters  may  be 
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estimated,  and  the  distribution  of  ranked  discharges  derived,  for  any  location  in  the 

study  area. 

Structure  of  Presentation 

Chapter  2  presents  a  review  of  the  literature  and  builds  a  framework  within 
which  to  place  this  dissertation  in  the  broader  perspective  of  hydrologic  research. 
A  description  of  the  study  area  and  data  is  provided  in  chapter  3.  The  theoretical 
model  for  flow-duration  curves  is  developed  in  chapter  4.  Chapter  5  presents  the 
results  of  the  model  in  the  context  of  the  hydrology  of  the  study  area  and  develops 
a  set  of  maps  which  may  be  used  for  spatial  prediction.  Some  additional  implications 
and  directions  of  future  research  are  presented  in  chapter  6.  Chapter  7  presents  a 
summary  of  this  dissertation  and  some  conclusions. 


CHAPTER  2 
LITERATURE  REVIEW 


This  chapter  reviews  the  literature  on  the  flow-duration  curves  and  places 
their  use  within  the  broader  framework  of  flow-frequency  investigation  in  hydrology. 
Additionally,  the  problems  of  spatial  analysis  and  prediction  in  both  geographic  and 
hydrologic  terms  are  discussed. 

Flow-Duration  Curves 

All  discussion  in  this  dissertation  is  restricted  to  the  most  common  observation 
of  daily  discharge.  However,  the  concepts  may  be  applied  directly  to  other  discharge 
measures  such  as  mean  monthly  or  mean  annual  flows.  Daily  discharge  at  a  gage 
represents  the  spatially  integrated  outflow  from  a  basin.  The  streamflow  time  series 
reflects  the  inputs  and  storage  characteristics  of  the  basin  controlled  by  basin  of 
factors  such  as  cHmate,  topography  and  geology  (Searcy,  1959).  The  flow-duration 
curve  provides  a  summary  of  these  factors  and  the  magnitude-frequency  relationship 
for  discharge  in  a  basin  (Wolman  &  Miller,  1960). 

Construction  Techniques 

There  are  four  techniques  used  to  construct  flow-duration  curves.  Each 
approach  provides  curves  which  are  slightly  different  in  shape  and  interpretation. 
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They  may  be  divided  into  two  types:  those  deahng  with  the  entire  record  of  the  river 

and  those  providing  an  annual  characterization  of  the  river.  Both  may  be  calculated 

over  a  water  year,  however  further  discussion  is  restricted  to  the  calendar-year. 

Total-Period  Technique 

All  data  are  sorted  in  descending  order  of  magnitude.  Each  observation  of 
discharge  becomes  the  ordinate,  and  a  plotting  position,  such  as  the  WeibuU 
(Pj  -i/i^"^  1)  where  j  is  the  rank),  is  the  abscissa.  The  result  is  that  each  observation 
is  plotted  as  one  segment  of  the  curve,  (Foster,  1934;  Searcy,  1959;  Watson,  1934). 

Foster  (1934)  describes  this  technique  as  the  most  accurate,  as  it  faithfully 
represents  the  entire  flow-frequency  time  history  of  the  river.  Construction  is  time 
intensive,  and  inclusion  of  additional  data  requires  re-drafting  of  the  entire  curve. 
Important  annual  observations,  such  as  the  annual  flood,  are  lost  because  all  historic 
data  are  aggregated  before  deriving  the  curve.  This  curve  should  be  interpreted  as 
showing  the  proportion  of  time  in  the  entire  record  that  a  particular  discharge  is 
equalled  or  exceeded. 
Complete-Record  Technique 

Observed  data  are  assigned  to  a  set  of  equally-sized  classes.  The  curve  is 
generated  by  graphing  the  class  midpoint  (y-axis)  against  cumulative  relative 
frequency  (x-axis)  and  represents  the  entire  flow-frequency  history  of  the  river,  but 
has  the  concomitant  advantages  and  disadvantages  of  classified  data.  Searcy  (1959) 
describes  this  as  a  more  accurate  form  of  the  curve  when  compared  to  the  calendar- 
year  curve.  New  observations  are  easily  added  to  this  curve,  and  updating  requires 


only  the  redrawing  of  each  class.    Annual  information  is  lost  in  constructing  this 
curve  and  care  must  be  taken  in  extracting  values  from  the  classified  data. 
Average  Annual  Hydrograph  Technique 

The  average  annual  hydrograph  is  constructed  by  calculating  mean  discharges 
by  day,  over  the  years  of  record.  The  resulting  average  annual  hydrograph  is  then 
sorted  in  descending  order  and  average  daily  discharge  plotted  against  rank.  The 
curve  thereby  retains  annual  information  about  the  average  discharge  at  each  day. 

Because  averaging  is  conducted  by  day,  and  subsequently  sorted,  daily 
variances,  which  tend  to  be  high,  are  unrepresented  in  the  finished  product.  The 
highest  flow  in  a  river  does  not  occur  on  the  same  day  of  every  year,  nor  is  every 
year  similar.  The  result  is  to  mix  the  individual  ranked  flows  and  to  lose  detail  about 
annual  variability.  Updating  the  curve  is  relatively  easy,  as  only  365  points  have  to 
be  recalculated  per  year.  The  primary  benefit  of  this  technique  is  that  years  of  only 
partial  discharge  record  may  be  included.  Interpreting  this  curve  is  very  difficult  as 
the  information  it  presents  is  the  ranked  characterization  of  the  seasonality  function 
of  the  river. 
Calendar- Year  Technique 

Observations  of  discharge  for  each  year  are  ranked  and  averages  by  rank 
calculated.  The  plot  of  average  by  rank  is  the  annual  flow-duration  curve.  This 
curve  truly  represents  the  average  ranked  flow  behavior  in  a  river.  Traditional  flow- 
frequency  measures  such  as  the  mean  annual  flood,  are  preserved.    Searcy  (1959) 
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describes  the  curve  as  less  accurate  than  the  complete-record,  but  acknowledges  that 

it  is  the  only  true  representation  of  the  year-to-year  behavior. 
Discussion 

Each  construction  technique  has  particular  merits  and  produces  very  different 
results.  Both  the  total-period  and  the  complete-record  approaches  provide 
information  about  the  entire  recorded  time  history  of  the  river.  The  average  annual 
hydrograph  and  calendar-year  curves  provide  annual  information. 

Figures  1  (A  &  B)  and  2  (A  &  B)  compare  the  curves  resulting  from  the  four 
construction  techniques  for  the  Sumas  River  in  British  Columbia.  The  total-period 
and  complete-record  curves  (Figure  1)  are  very  similar.  The  total-period  curve  tends 
to  give  shghtly  higher  discharges  for  all  percentages  than  does  the  complete-record. 
This  is  especially  noticeable  at  both  the  lowest  and  highest  percentage  regions  of  the 
curve.  The  greatest  handicap  of  the  complete-record  technique  is  the  use  of  classes. 
Any  year  of  record  may  not  contribute  to  all  classes  evenly,  class  sizes  will  vary 
greatly  and  the  resulting  curve  may  be  very  different  for  different  numbers  of  classes. 

In  Figure  2  both  axes  are  identical  and  the  graphs  may  be  accurately 
compared,  yet  the  average  annual  hydrograph  and  calendar-year  curves  are  very 
different.  The  average  annual  hydrograph  technique  gives  lower  discharge  values  at 
the  low-rank  end  of  the  curve  than  does  the  calendar-year  curve,  while  the  middle 
regions  of  both  curves  are  very  similar.  For  low  ranks,  the  average  annual 
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hydrograph  curve  provides  estimates  of  discharge  which  are  one  third  of  those 

provided  by  the  calendar-year  curve. 

Comparing  the  curves  across  techniques,  the  complete-record  curve  (Figure 
1-B),  recommended  by  the  U.S.G.S.  (Searcy,  1959),  tends  to  estimate  higher 
discharges  for  rare  events  than  does  the  calendar-year  curve.  For  most  of  the  values 
on  the  X-axis,  the  curves  are  identical,  diverging  again  at  the  low  discharge  end  of  the 
curve  where  the  complete-record  technique  gives  values  of  discharge  less  than  the 
calendar-year. 

Flow-duration  curves  are  generally  used  to  make  qualitative  comparisons 
between  different  flow  regimes  and  to  infer  basin  characteristics.  Two  of  the  four 
curves  have  no  theoretical  parallel  in  statistical  theory.  There  is  however  a  similarity 
between  the  construction  of  both  the  calendar-year  and  total-period  curves  and  the 
field  of  order  statistics. 

Relation  to  Traditional  Frequency  Measures 

One  of  the  most  significant  factors  in  the  emergence  of  hydrology  as  a 
separate  and  distinct  science  is  the  full-fledged  use  of  probabilistic  and  random 
process  techniques  in  the  study  of  hydrologic  problems  (Bras  &  Rodriguez-Iturbe, 
1985).  The  probabilistic  approach  has  allowed  hydrologists  to  model  phenomena 
which  would  have  been  impossible  or  too  expensive  to  model  deterministically, 
phenomena  as  diverse  as  hydraulic  conductivity  (Freeze,  1975)  and  paleoflood  events 
(Baker,  1985).    The  ability  of  stochastic  models  to  represent  what  is  known  and 
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important  about  the  phenomenon  and  to  represent  the  unknown  or  unobserved 

portions  by  probability  distributions,  has  freed  hydrologists  from  over-abstraction  or 

'mathematistry'  (Box,  1976;  Klemeg,  1986).   This  permits  the  marrying  of  a  strong 

theoretical  basis  to  a  sound  physical  basis  in  the  models  (Potter,  1987). 

Current  hydrologic  frequency  analysis  emphasizes  a  better  understanding  of 
the  physical  processes  affecting  the  flood  frequency  characteristics  in  order  to 
construct  improved  risk  prediction  models  (Kleme§,  1986;  Potter,  1987).  Knowledge 
of  hydrologic  generating  processes  has  led  to  increased  emphasis  on  the  role  of  the 
drainage  basin,  its  inputs  via  the  hydrologic  cycle  and  their  interaction  with  the  basin 
(Dooge,  1986).  The  criteria  for  an  improved  hydrologic  model  are  now  not  only 
goodness-of-fit  but  also  physical  reasonableness  and  relative  parsimony  (Yevjevich 
&  Obeysekera,  1985).  Such  models  show  an  increasing  analytical  sophistication 
(Potter,  1987),  and  may  include  mixed  processes  (Rossi  et  al.,  1984;  Waylen,  1985a), 
time  series  approaches  (Smith  &  Karr,  1986)  and  the  analysis  of  regionalized 
variables  (Acreman,  1985). 

Analysis  of  annual  floods  is  perhaps  the  most  intensely,  widely  and  longest 
studied  probabilistic  problem  in  hydrology.  This  concentration  is  largely  historical 
and  is  continued  out  of  convenience  and  the  fact  that  there  are  few  better  measures 
of  extraordinary  flooding  than  the  annual  flood.  Approaches  to  the  problem  have 
ranged  from  strictly  empirical  modelling  (Hazen  1930;  Jarvis,  1936),  through  the 
application  of  extreme  value  theory  (Fisher  &  Tippett,  1928;  Gumbel  1941  &  1958), 
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and  lately  to  the  selection  of  best-fit  distributions  (Lamberti  &  Pilati,  1985)  and  exact 

analytical  solutions  from  other  forms  of  analysis  (Zelenhasic,  1970). 

Annual  flood  studies  can  be  divided  into  two  broad  approaches,  one  stressing 
extreme  value  theory  as  a  justification  of  the  choice  of  a  particular  distribution,  the 
other  abandoning  theoretical  bases  and  stressing  goodness-of-fit.  Extreme  value 
distributions  have  been  rejected  on  the  grounds  that  the  physical  generating  processes 
of  streamflow  do  not  meet  the  mathematical  assumptions  of  the  theory  (Gupta  et  al., 
1976).  However,  some  argument  for  the  use  of  extreme  value  theory  is  still  made, 
on  the  grounds  that  the  annual  flood  is  the  extreme  value  of  discharge  observed  in 
a  year  (Cunnane,  1985). 

Extreme  value  theory  yields  the  three  extreme  value  (EV)  distributions,  the 
EV  1  or  Gumbel  distribution,  the  EV  2  or  Frechet  distribution,  and  the  EV  3  or 
Weibull  distribution.  Jenkinson's  (1954  &  1969)  proposal  that  a  generalized  extreme 
value  distribution  (GEV)  be  used,  has  been  widely  adopted  (Hosking  &  WaUis, 
1986).  The  extreme  value  approach  has  been  successful  (Waylen  &  Woo,  1983; 
Wiltshire,  1986b),  despite  its  apparent  lack  of  theoretical  support  and  the  problems 
of  small  sample  sizes,  independence  of  floods  and  non-stationarity  in  the  distributions 
of  their  magnitudes  (Fiorentino  et  al.,  1985). 

Conversely,  the  disregard  for  any  theoretical  basis  in  the  selection  of  a 
probability  distribution  to  model  annual  floods  has  resulted  in  a  wide  variety  of 
distributions,  each  with  its  own  set  of  advocates.  Distributions  chosen  have  included 
the  EV  I  or  Gumbel  (Waylen  &  Woo,  1982  &  1983),  EV  U  and  EV  in  extreme 
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value  distributions,  a  generalized  extreme  value  distribution  (Wiltshire,  1986b), 

mixtures  of  the  extreme  value  type  I  (Waylen  &  Caviedes,  1986;  Waylen  &  Woo, 

1982),  the  Pearson  type  III  (Matalas  &  Wallis,  1973)  and  log  Pearson  type  HI  (Water 

Resources  Council,  1967),  the  lognormal  (Chow,  1954),  the  gamma  distributions  and 

mixtures  of  the  analytical  form  from  the  partial  duration  series  (Rossi  et  al.,  1984). 

These  distributional  choices  have  no  theoretical  justification,  but  they  have  the  ability 

to  fit  the  data  at  a  station  and  permit  spatial  comparison.   Regional  flood  models 

have  also  been  proposed  that  require  the  assumption  of  dependence  structures 

between  sites  and  the  parameters  of  flood  distributions  (National  Research  Council, 

1988). 

The  analysis  of  annual  low-flows  has  always  been  the  less  studied  of  the 

annual  measures.    Numerical  advances  parallel  those  of  annual  flood  research. 

Gumbel  (1954a)  applied  extreme  value  theory  to  the  study  of  droughts,  as  reflected 

by  the  lowest  discharge  in  a  year.   He,  and  later,  Matalas  (1963),  used  the  EV  HI 

extreme  value  distribution,  which  is  bounded  in  the  direction  of  interest.  Waylen  and 

Woo  (1987)  have  successfully  used  mixtures  of  extreme  value  distributions  to 

represent  the  differing  low-flow  generating  processes  in  snowmelt  rivers.  Often  the 

aimual  low-flow  is  used  as  a  surrogate  for  measuring  hydrologic  drought.  Dracup  et 

al.  (1980b)  differentiate  between  low-flows  and  droughts  on  the  basis  of  streamflow 

and  the  period  over  which  the  low-flow  is  estimated.    Chang  and  Boyer  (1977) 

estimate  the  annual  7-day  average  low-flow  with  a  10  year  return  period  from 

watershed  and  climatic  parameters.  Dracup  et  al.  (1980a  &  b)  and  Zelenhasic  and 


16 

Salvi  (1987)  used  the  partial  duration  series  to  define  hydrologic  droughts  on  the 

basis  of  streamflow  below  some  level. 

Spatial  Estimation  in  Hydrology 

When  hydrologic  records  are  available  they  are  often  short  and  may  contain 
sampling  errors.  Simulation  or  extension  of  these  records  simply  preserves  and  may 
amplify  these  errors.  Frequently  records  are  simply  not  available  for  the  river  of 
interest  but  are  available  for  nearby  rivers.  It  is  therefore  often  desirable  to  spatially 
aggregate  available  records,  thereby  enhancing  the  regional  record  and  increasing  the 
effective  sample  size. 

Within  the  greater  issue  of  the  spatial  problem  in  hydrologic  estimation,  three 
separate  questions  may  be  identified:  1)  At  what  scale  should  an  analysis  be 
conducted?  2)  Should  the  data  be  regionalized,  and  how  should  the  regions  be 
defined?  3)  What  technique  should  be  used  to  make  predictions? 

Basin  Scale 

Analysis  of  hydrologic  data,  in  particular  streamflow,  is  frequently  undertaken 
at  the  basin  scale.  Streamflow  is  an  integrated  measure  of  all  the  processes  and 
influences  active  within  that  basin.  Increased  basin  area  generally  produces  greater 
inputs  and  increased  basin  storage.  Given  the  range  of  possible  basin  sizes,  Kleme§ 
(1983)  warns  that  hydrologists  should  be  sensitive  to  the  differences  in  behavior 
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resulting  from  basin  area  and  to  avoid  direct  comparisons  between  the  large  and 

small. 

The  effects  of  the  drainage  basin  are  most  pronounced  in  the  frequency 
distribution  of  daily  discharges.  Greater  basin  area  promotes  a  larger  mean  daily 
discharge  and  lower  skewness.  Daily  variance  on  the  other  hand  appears  to  be 
largely  a  function  of  seasonaHty. 

In  terms  of  physical  effects,  the  presence  of  large  amounts  of  lake  storage 
produces  a  more  even  distribution  of  flows  and  moderates  the  range  of  flows.  Snow 
and  ice  storage  produces  rapid  releases  of  water  during  the  melt  season,  and 
consequently  generate  the  highest  sustained  flows.  Ice  storage  in  the  form  of  glaciers 
releases  water  more  consistently  during  the  warm  season.  The  role  soils,  geology  and 
slope  of  the  basin  as  controls  on  ground  water  contributions  are  noticeable  in  the 
low-flow  season. 

The  basic  technique  for  'standardizing'  flows  from  a  variety  of  drainage  basins, 
of  varying  sizes,  is  to  divide  discharge  by  drainage  area  (Smith,  1989),  yielding 
discharge  in  units  of  volume  per  unit  time  per  unit  area  (specific  runoff).  This 
technique  allows  a  crude  comparison  between  large  and  small  basins.  Basin  area  is 
chosen  because  it  is  generally  easy  to  measure  and  is  strongly  related  to  other  very 
important  characteristics  such  as  relief,  slope,  drainage  density  and  basin  shape 
(Gregory  &  Walling,  1973).  All  else  constant,  the  magnitude  of  standardized  basin 
runoff  is  inversely  proportional  to  catchment  area. 
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Flows  may  also  be  standardized  by  'indexing'  them  as  ratios  of  mean  annual 

flow  or  mean  annual  flood  (Acreman,  1985;  Natural  Environment  Research  Council, 

1975).  Acreman  (1985)  further  suggests  that  the  denominators  in  the  indices  may  be 

generated  from  basin  parameters  and  not  from  the  record  itself.  A  third  technique, 

advocated  by  Fiorentino  et  al.  (1985)  and  Rossi  et  al.  (1984),  standardizes  flows  by 

use  of  estimated  parameters,  similar  to  a  z-score.     Similar  to  the  index  flood 

approaches,  basin-scale  data  may  be  expressed  as  standardized  values  at  region  scale. 

This  technique  involves  aggregation  of  similar  basin's  data  to  get  larger  sample  sizes 

and  more  stable  and  less-biased  parameter  estimates,  which  lend  themselves  to  the 

analysis  of  annual  flood  data  (Dalrymple,  1960;  Natural  Environment  Research 

Council,  1975). 

Regionalization.  Prediction  and  Estimation 

There  are  two  major  approaches  to  the  analysis  of  spatial  variation  in 
hydrology:  1)  identification  of  homogenous  drainage  basins  and  2)  modelling  the 
variability  of  hydrologic  models  continuously  over  space.  In  the  first  approach,  basins 
possessing  similar  statistical  properties  are  grouped  and  are  examined  for  similarity 
and  in  the  second,  the  parameters  estimated  in  each  basin  are  mapped  and  their 
spatial  pattern  expressed  mathematically. 

Homogeneity  statistics  are  based  on  physical  and/or  flow  characteristics  of  the 
basin  (Wiltshire,  1986a  &  b).  Soil  type,  average  slope,  lake  storage,  snow  or  ice 
storage,  mean  annual  precipitation,  mean  aimual  temperature  and  basin  area  have 
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all  been  used  as  physical  indicators  (Gottschalk,  1985;  Matalas,  1967;  Wiltshire, 

1985).  Wiltshire  (1985)  employed  climatological  variables  including  average  annual 
rainfall  in  combination  with  some  basin  morphology  measures  as  the  basis  for 
classification.  Flow  characteristics,  index  floods,  distribution  type,  parameter  values 
of  a  regional  parent  distribution,  estimated  discharge  for  a  particular  quantile  (or 
return  period)  or  general  statistical  measures  may  be  put  to  the  same  purpose 
(Matalas  et  al.,  1975;  Natural  Environment  Research  Council,  1975;  Wiltshire, 
1986a). 

Inherent  in  the  idea  of  geographic  regionalization  is  classification,  the 
technique  by  which  a  site  observation  is  determined  to  belong  to  a  particular  region. 
The  classification  of  regions  is  performed  on  the  basis  of  two  criteria:  1)  minimizing 
the  variability  within  each  region  or  class  and  2)  maximizing  the  differences  between 
classes  (Smith,  1975). 

Hydrology  has  stressed  the  aspect  of  homogeneity,  or  minimizing  the  within- 
class  variability,  as  the  basis  for  determining  regions.  A  variety  of  statistics  have 
been  devised  (Lettenmaier  &  Potter,  1985;  Lettenmaier  et  al.,  1987;  Wiltshire,  1986a 
&  b),  to  determine  regional  homogeneity,  the  simplest  of  which  is  the  regional  mean 
coefficient  of  variation.  The  presence  of  a  common  underlying  flood  frequency 
distribution  which  varies  in  only  a  few  parameters  has  been  a  basic  assumption  of 
hydrologic  regionalization.  Regional  models  have  been  shown  to  be  resistant  to  mis- 
specification  of  the  underlying  distribution  (Lettenmaier  et  al.,  1987),  regional 
heterogeneity  (Lettenmaier  et  al.,  1987)  and  intersite  dependence  (Hosking  &  Wallis, 
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1988).  However,  Lettenmaier  et  al.  (1987,  page  313)  identify  "the  lack  of  a  physical 

basis  for  determining  the  form  of  the  underlying  flood  frequency  distribution"  as  a 

major  complication  in  the  use  of  regionalization  techniques.  Assignment  of  basins 

to  a  region  is  completed  through  statistical  techniques  including  cluster  analysis 

(Mosley,  1981),  discriminant  analysis  (Waylen  &  Woo,  1984),  analysis  of  variance 

(Wiltshire  1985  &  1986a),  principal  components  and  pairwise  grouping  (Gottschalk, 

1985). 

Continuous  mapping  techniques  such  as  trend  surfaces  (Unwin,  1975;  Woo  & 
Waylen,  1984),  multiple  regression  (Leith,  1976  &  1978)  and  kriging  (Gambolati  & 
Volpi,  1979;  Olea,  1974;  Woolhiser  &  Roldan,  1986)  have  been  used  to  describe  the 
variability  of  the  parameters  of  a  regional  parent  flood  distribution  (Matalas  et  al., 
1975),  or  mixed  flood  frequency  distributions  (Woo  &  Waylen,  1984).  Prediction  of 
characteristics  of  ungaged  locations  requires  estimated  surface  coefficients,  basin 
location  and  the  necessary  basin  variables.  The  accuracy  of  the  results  is  determined 
by  the  reliability  of  the  estimated  coefficients  and  of  the  measured  basin  variables. 
The  technique  of  kriging  is  largely  a  mapping  tool  and  provides  much  better  contour 
results  than  most  conventional  mapping  techniques  (Olea,  1974).  The  mapped 
results  from  kriging  do  not  provide  the  kind  of  false  accuracy  of  the  other  techniques 
but,  at  the  same  time,  kriging  does  not  allow  mathematical  prediction,  only 
interpolation,  from  a  map. 

Geographic  regionalization  aims  for  the  identification  of  regions  of  underlying 
common  behavior  and  characteristics  for  the  goal  of  spatial  generalization  with  the 
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added  requirement  of  spatial  continuity  (Amedeo  &  Golledge,  1975;  Smith,  1975). 

Hydrologic  regionalization  aims  for  the  identification  of  regions  of  homogeneity  in 

the  desired  characteristics  for  the  goal  of  substituting  space  for  time  and  lengthening 

the  effective  record  of  hydrologic  observations;  this  does  not  require  geographic 

proximity  (Lettenmaier  et  ah,  1987;  Wiltshire  1985). 


CHAPTERS 
STUDY  AREA  AND  DATA 


This  chapter  presents  a  brief  overview  of  the  dimate  and  physiography  of 
British  Columbia,  Canada.  The  description  is  aided  by  a  series  of  maps  and  graphs 
to  illustrate  the  wide  variety  of  climatic  and  streamflow  regimes  observed  over  the 
province.  A  description  of  the  data  and  their  source  is  provided  in  the  last  section 
of  this  chapter. 

Study  Area 

Rationale 

The  province  of  British  Columbia,  Canada  (Figure  3)  is  chosen  as  the  study 
area  because  of  the  following  significant  advantages: 

1)  A  large  number  of  excellent  quality  discharge  records  exist  for  the  area. 
Many  records  begin  in  the  1920s,  and  more  than  100  possess  at  least  fifteen 
years  of  complete  record  (see  Appendix  A). 

2)  The  density  of  gaging  stations  exceed  the  minimum  standard  set  by  the 
World  Meteorological  Organization  (W.M.O.)  for  mountainous  terrain  of  one 
station  for  each  5,000  km^.  British  Columbia  has  one  station  for  each  4700 
km^  (Kreuder,  1979;  World  Meteorological  Organization,  1981). 
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Figure  3.  Physiographic  Regions 
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Gaging  station 


Figure  4.  Hydrometric  Network 


25 

3)  The  network  (Figure  4)  of  gaging  stations  has  been  designed  to  cover  as 

many  of  the  major  rivers  as  possible  (Kreuder,  1979)  and  to  provide  a  number 
of  transect  lines  across  the  province. 

4)  There  are  many  large  and  small  rivers  whose  flows  are  not  significantly 
modified  by  human  activity  in  a  variety  of  physiographic  and  hydrologic 
environments. 

On  this  basis,  one  hundred  and  eighty  two  gaging  stations  are  included  for 
analysis  in  this  research.  Figure  4  shows  the  hydrometric  network  of  these  gaging 
stations,  and  Appendix  A  lists  each  gage,  it's  full  name  and  location.  To  be  included 
in  the  analysis,  the  gage  must  be  located  on  a  basin  without  substantial  modification 
such  as  diversions  or  dams  and  possess  nineteen  or  more  years  of  complete  record. 
In  areas  of  poor  spatial  coverage,  gages  with  more  than  sixteen  years  of  record  are 
used. 

General 

The  province  (Figure  3)  occupies  an  area  of  nearly  1  million  square 
kilometers  bounded  to  the  south  by  the  49 'th  parallel  and  to  the  north  by  the  60 'th. 
It's  northeastern  boundary  with  the  province  of  Alberta  is  marked  by  the  meridian, 
120°W,  and  follows  the  continental  divide  south  and  east  from  a  latitude  of 
approximately  53°30'N,  meeting  the  U.S./Canada  border  at  114°W.  To  the 
southwest,  the  province  is  bounded  by  the  Pacific  Ocean  and  to  the  northwest  by  the 
panhandle  of  Alaska.  Physiographically  most  of  the  province  encompasses  a  series 
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of  diverse  regions  collectively  termed  the  Western  Cordillera,  with  the  exception  of 

the  extreme  northeast,  which  occupies  the  northern  interior  plains  (Department  of 

Energy,  Mines  and  Resources,  Canada,  1973).    The  following  brief  review  of  the 

physical  geography  of  the  area  will  restrict  itself  to  those  factors  directly  related  to 

streamflow. 

Physiography 

The  Western  Cordillera  is  composed  of  three  principle  regions,  or  systems, 
which  run  sub-parallel  northwest-southeast.  The  geologic  history  of  the  region  is 
documented  by  Monger  and  Price  (1979).  The  western  system  is  fringed  by  areas  of 
coastal  plains,  the  Fraser,  Georgia  and  Hecate  Lowlands,  where  elevations  seldom 
exceed  50m.  Immediately  abutting  these  to  the  east,  and  frequently  intersecting  the 
fiorded  coastline,  lies  a  high  mountainous  region  of  the  Coast  and  Cascade 
Mountains,  which  attain  over  3000m  of  elevation. 

The  interior  system  is  a  complex  of  mountains  and  plateaus.  The  Stikine  and 
Nechako  Plateaus  lie  north  and  south,  respectively,  of  the  Skeena  and  Omineca 
Mountains.  They  comprise  subdued  relief  with  a  regional  surface  at  about  750m. 
Occasionally  the  relief  is  interrupted  by  Cenozoic  Volcanoes  which  rise  above  the 
surface  to  about  2000m  and  by  major  river  valleys,  which  may  be  as  much  as  600m 
below  the  plateau  surface.  Elevations  in  the  intervening  mountains  are  generally 
between  2000  and  2500m.  The  eastern  portions  of  the  interior  system  rise  from  the 
plateaus  to  the  Columbia  Mountains  (in  the  south)  and  the  Cassiar  Mountains  (in 
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the  north),  which  ultimately  attain  elevations  comparable  to  the  Coast  and  Rocky 

Mountains. 

The  eastern  system  of  the  northern  and  southern  Rocky  Mountains  is 
separated  from  the  interior  system  by  an  ancient  subduction  zone,  currently  forming 
the  low  lying  Rocky  Mountain  Trench.  Elevations  in  this  system  are  some  of  the 
highest  found  in  Canada,  frequently  supporting  permanent  glacial  cover.  The  Trench 
is  presently  occupied  by  the  Columbia,  Fraser,  Parsnip  and  Finlay  Rivers  from  the 
U.S.  to  the  Yukon  border. 

The  Peace,  Hay  and  Liard  Rivers  of  the  Arctic  drainage  flow  northeastwards 
towards  the  MacKenzie  River  through  the  Rocky  Mountain  Foothills  and  across  the 
Alberta  Plateau  to  the  Fort  Nelson  and  Peace  River  Lowlands,  where  elevations 
range  between  300  and  600,  in  a  gently  undulating  landscape. 

Precipitation 

The  following  discussion  of  climatological  and  hydrological  variables  are  based 
on  statistics  compiled  by  Environment  Canada  (1982  &  1988). 

The  two  parallel  systems  of  mountains  separated  by  the  interior  plateau  he 
across  the  dominant  westerly  winds  at  these  latitudes  and  exert  considerable 
influence  on  the  pattern  of  mean  annual  precipitation  (Figure  5).  On  the  windward 
side  of  the  Coast  Mountains  mean  annual  precipitation  may  exceed  3000mm  (Farley, 
1979)  with  a  marked  winter  seasonal  maxima  in  an  Alpine  Maritime  Climate  (Fraser 
River  Board,  1958).  Precipitation  decHnes  rapidly  on  the  Interior  Plateau  to  as  little 
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Figure  5.  Mean  Annual  Precipitation  (Contour  Interval  =  600nim) 
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as  200mm  in  the  southern  Dry  Continental  CHmate  or  400mm  in  the  northern  Humid 

Continental  Climate.  Precipitation  is  generally  uniformly  distributed  throughout  the 

year  with  the  possibility  of  a  slight  summer  maximum  resulting  from  convective 

activity. 

Increased  elevations  in  the  Columbia,  Cassiar  and  Rocky  Mountains  produce 
annual  precipitation  in  the  order  of  1500mm  with  a  winter  maximum.  This  Alpine 
Humid  Continental  Climate  is  highly  variable  locally;  the  Rocky  Mountain  Trench 
lies  in  a  very  pronounced  region  of  rainshadow.  To  the  north  of  the  Skeena  and 
Omineca  Mountains  the  altitude  of  the  Rocky  Mountains  diminishes  and  the  pattern 
of  rainfall  becomes  increasingly  controlled  by  latitude.  All  but  the  most  westerly 
extremes  of  the  northern  part  of  the  province  experience  300  and  SOOmm  of  annual 
precipitation  and  a  summer  maximum  in  a  climate  described  as  Alpine  Subarctic. 

Figure  6  locates  twelve  sample  climate  stations  which  demonstrate  the  mean 
monthly  precipitation  regimes  of  the  Province  (Figures  7  and  8).  Qualicum  River 
and  Chilliwack  are  located  along  the  Coast  Mountains  and  experience  a  winter 
rainfall  maximum,  which  progresses  towards  a  fall  maximum,  at  Kitimat.  Bralome, 
Telegraph  Creek  and  Fort  Babine  are  located  in  the  rainshadow  of  the  Coast 
Mountains;  they  show  an  even  distribution  of  precipitation  through  the  year,  with  a 
slight  increase  of  winter  precipitation  in  the  south  at  Bralome.  Smith  River,  Pink 
Mountain,  Puntchesakut  Lake  and  Hemp  Creek  have  distinct  summer  maxima  in 
precipitation.  Vernon  and  Cranbrook  show  a  mixing  of  both  winter  and  summer 
maxima  leading  to  an  almost  uniform  distribution  of  precipitation  through  the  year. 
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Figure  6.  Sample  Climate  Stations 
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Figure  7.  Northern  Climate  Stations  Mean  Monthly  Precipitation 
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Figure  8.  Southern  Climate  Stations  Mean  Monthly  Precipitation 
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Temperature 

Monthly  temperatures  (Figures  9  and  10)  are  primarily  a  function  of  elevation, 
latitude  and  distance  from  the  Pacific  Ocean.  Continentality  increases  rapidly  inland 
as  the  Coast  Mountains  effectively  exclude  the  moderating  effect  of  the  Pacific. 
Application  of  a  simple  adiabatic  lapse  rate  to  the  3000m  of  available  relief  gives 
some  idea  of  the  influence  of  elevation.  Mean  annual  temperatures  range  from 
above  10°C  on  the  Pacific  southwest  to  well  below  zero  in  the  Northern  Interior. 
The  most  significant  effect  that  temperature  has  on  streamflow  is  in  determining  the 
timing  and  quantities  of  water  released  from  snow  and  ice  storage  (Pipes  et  al., 
1970).  On  the  Fraser  Lowland  snow  makes  up  less  than  two  percent  of  total  annual 
precipitation  while  it  may  constitute  greater  than  70  percent  in  the  northeast 
mountains. 

In  general,  the  lowest  monthly  temperature  occurs  in  January  and  the  highest 
in  July  and  August.  The  moderating  influence  of  the  Pacific  Ocean  is  shown  in  the 
thermographs  of  Chilliwack  and  Qualicum  River  where  average  temperatures  remain 
above  0°C  through  the  year.  Both  eastward  and  northward  the  thermal  regime  is 
marked  by  increasing  annual  range  and  longer,  colder  winters.  The  influence  of 
elevation  is  not  readily  demonstrated  in  these  thermographs,  as  most  climate  stations 
in  British  Columbia  are  found  in  the  valley  bottoms  with  elevations  less  than  1,000m. 
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Figure  9.  Northern  Climate  Stations  Monthly  Temperatures 
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Figure  10.  Southern  Climate  Stations  Monthly  Temperatures 
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Streamflow 

All  streamflow  regimes  in  the  study  area  are  highly  seasonal.  With  the 
exception  of  Vancouver  Island  and  the  west  side  of  the  Coast  Mountains,  most  are 
dominated  by  snowmelt  and  have  pronounced  winter  minima.  At  higher  elevations 
and  latitudes  glacial  melt  becomes  an  important  factor  in  maintaining  summer  high 
flows.  Along  the  coast  the  intrusion  of  warm  moist  Pacific  air  and  accompanying 
frontal  systems  may  produce  heavy  winter  rains,  though  their  influence  diminishes 
inland. 

Figure  11  shows  twelve  gaging  stations  on  rivers  located  close  to  the  previous 
twelve  climate  stations.  The  different  monthly  flow  regimes  across  the  province  are 
depicted  in  Figures  12  and  13.  Little  Qualicum  River  on  Vancouver  Island 
experiences  a  winter  maximum.  This  basic  pattern  is  modified  in  the  west-facing 
slopes  of  the  Coast  and  Cascade  Mountains  to  include  spring  snowmelt  and  summer 
glacier  melt  at  the  higher  elevations  (Slesse  Creek  and  Kitimat  River).  The  interior 
of  the  province  is  dominated  by  snowmelt  regimes  (Coldstream  and  Kootenay  Rivers) 
of  much  smaller  magnitude  than  the  coastal  basins  and  has  no  winter  component. 
The  magnitude  of  the  summer  discharge  peak  largely  reflects  the  amount  of  snowfall 
and  snow  storage  in  each  basin.  Gages  are  located  in  the  valleys  but  the  gaged  flows 
represent  the  runoff  generated  from  relatively  large  areas  frequently  well  above  the 
gage  itself. 
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Figure  12.  Northern  Gaging  Stations  Mean  Monthly  Discharge 
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Figure  13.  Southern  Gaging  Stations  Mean  Monthly  Discharge 
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Data 

The  necessary  daily  discharge  data  for  complete  calendar  years  of  record  were 
obtained  from  the  Water  Survey  of  Canada  (W.S.C.)  which  is  responsible  for  the 
archiving  and  the  conducting  of  a  variety  of  quality  control  tests  on  the  data,  to 
ensure  its  accuracy.  Data  were  obtained  directly  in  computer  compatible  format  on 
magnetic  tape  in  cubic  meters  per  second.  (Environment  Canada,  1980). 

The  requirements  of  the  data  and  study  area  included  1)  sampling  of  as  wide 
a  variety  of  environments  and  flow  regimes  as  possible,  2)  long  discharge  records,  3) 
sampling  of  a  wide  range  of  basin  sizes,  and  4)  a  relatively  high  station  density,  with 
stations  sampling  similarly-sized  contiguous  basins. 


CHAPTER  4 
STATISTICAL  MODEL 


Approach  and  Theory 

In  the  calendar-year  flow-duration  curve  both  the  mean  discharge  of  each 
annual  rank  and  an  associated  variability  about  that  mean  are  retained.  The 
empirical  construction  of  this  curve  has  a  strong  theoretical  parallel  to  order 
statistics,  which  may  be  used  to  estimate  the  mean  discharge  at  any  rank  once  the 
nature  of  the  parent  distribution  of  all  daily  streamflows  on  the  river  is  known. 

This  chapter  presents  the  development  of  a  statistical  model  for  the 
construction  and  use  of  flow-duration  curves.  An  outline  of  order  statistics  is 
provided,  the  theoretical  model  is  then  developed  and  the  concept  and  possible  forms 
of  a  parent  distribution  for  the  generation  of  streamflows  discussed.  Finally  a  review 
of  the  techniques  employed  in  spatially  modelling  is  presented. 

Order  Statistics 

An  order  statistic  is  a  ranked  observation  from  a  sample  (Galambos,  1984). 
This  is  similar  to  the  process  by  which  daily  discharges  are  ranked  to  construct  a 
calendar-year  flow-duration  curve,  where  the  sample  is  one  year's  observations.  The 
ranking  is  assumed  to  be  in  descending  order  of  magnitude  throughout  the  research 
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but  could  equally  be  assumed  to  be  ascending.    There  are  two  forms  of  order 

statistics:  absolute  and  relative.  In  the  first,  the  ranks  are  calculated  while  the  size 

of  each  sample  changes;  for  instance,  the  distribution  of  the  second  largest 

observation  may  be  studied,  regardless  of  sample  size  (so  long  as  it  is  larger  than 

one).    In  the  second,  the  rank  is  expressed  as  a  ratio  of  the  sample  size  and  the 

distribution  of  the  variable  at  a  fixed  value  of  that  ratio  is  considered  (for  example, 

0.5  for  the  median).  The  ratio  occupying  that  rank  is  a  relative  order  statistic.  When 

the  sample  size  is  fixed,  as  in  the  case  of  daily  streamflow  observations  per  year, 

excluding  February  29  on  leap  years,  these  two  approaches  are  interchangeable. 

Let  Xj,  X2,  .  .  .  ,  X„  denote  a  random  sample  from  a  population  with 

continuous  cumulative  distribution  function  Fx-     Within  that  sample,  a  unique 

ordered  arrangement  should  exist:  suppose  that  X^j^  denotes  the  largest  of  that  set; 

X(2)  denotes  the  second  largest,  and  so  on;  and  X^^^  denotes  the  smallest.  Then 

^1)   >   ^(2)   >    •  •  •   >   X(„) 

denotes  the  original  random  sample  after  arrangement  in  decreasing  order  of 
magnitude,  and  are  collectively  termed  the  order  statistics  of  the  random  sample. 
X(^),  for  1  <  r  <  n  is  called  the  r'*"  order  statistic  (Gibbons,  1985). 

The  annual  flood,  which  is  the  maximum  of  the  annual  sample,  will  be 
denoted  by  X^^^;  the  minimum  value  (the  annual  low-flow)  will  be  denoted  by  X^^^, 
and  the  annual  median,  a  measure  of  central  tendency  in  the  sample,  will  be  X^n+j^/j) 
or  X(n /2)  for  even  sample  sizes. 
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The   ensuing  results   from   order  statistics   originate   from   large-sample 

approximations,  and  not  exact  solutions.    It  is  assumed  that  the  case  of  365  daily 

observations  is  sufficiently  large  to  constitute  a  large  sample.     Gibbons  (1985) 

presents  the  general  results  of  the  large-sample  approximation  to  the  mean  of  the  r"* 

order  statistic  X^^^  of  a  sample  n  from  the  continuous  distribution  Fx-  The  first  and 

second  approximations  to  the  mean,  by  rank  are 


First  Approximation:  /i, .  =  F^  (h(r)) 

Second  Approximation:  /x. .  =  Fx  (h(r))  - 

(g(r)/2!)fi(Fx\h(r)))fx(Fx>(r)))- 


(1) 


where 


h(r)  =  1  -  J- 
n+1 

,  s.  r(n-r+l) 

g(r)  =        '^  ^ 


(2) 


(n+l)2(n+2) 

and  Fx  is  the  cumulative  distribution  function,  Fx'^  is  the  inverse  distribution 
function,  fx  is  the  density  function,  and  fx'  is  the  first  derivative  of  the  density 
function.  The  second  approximation  provides  greater  accuracy  than  the  first  alone 
(Gibbons,  1985). 

There  are  obvious  similarities  between  the  methodology  of  Gibbons  (1985) 
and  the  construction  of  the  calendar-year  flow-duration  curve.  Estimates  of  ^(r).  the 
mean  by  rank,  from  order  statistics  theory  should  enable  the  construction  of  a  flow- 
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duration  curve,  once  the  underlying  frequency  characteristics  of  the  population  are 

known. 

Parent  Distribution 

The  parent  distribution  is  the  marginal  distribution  of  the  aggregate  of  all  the 
observed  discharge  records.  The  correct  choice  of  a  distribution  function  to 
represent  the  parent  is  critical,  as  it  governs  both  the  general  form  or  shape  of  the 
flow-duration  curve  and  the  actual  values  of  the  curve.  Generally  the  distribution  of 
daily  discharge  is  similar  to  those  of  many  seasonal  series  and  is  both  highly  skewed 
and  peaked.  This  shape  is  suggestive  of  a  lognormal,  gamma  or  a  generalized 
extreme  value  distribution. 
Lognormal  Distributions 

The  choice  of  a  lognormal  frequency  distribution  for  daily  discharge  is  not 
uncommon  in  the  literature  (e.g.,  Ashmore  &  Day,  1988).  There  are  two  forms  of 
the  lognormal  distribution,  two  parameter  and  three  parameter.  In  the  real  data 
space,  the  distribution  is  positively  skewed  and  bounded  by  zero  on  the  left,  which 
are  two  characteristics  that  make  this  distribution  a  good  choice  for  modelling  of 
streamflow  (Haan,  1977).  In  the  log  transform  domain,  the  data  are  normally 
distributed.  The  distribution  is  derived  by  the  transformation  y  =  log(x),  and 
moment  estimators  (Table  1)  are  used  to  find  the  first  two  parameters  giving  a 
density  function  of 
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(logx-M , 


f(x)= L^e 


.         ^1 
1       A    K    ]  (3) 


xa,. 


and  a  distribution  function  of 


y  -^ 


F(Y<y)=  r — i — e  ^  ^'  Jdt 


(4) 


<a  JItT 


The  three-parameter  form  of  the  distribution  is  similar,  except  for  a  non-zero 
lower  bound  which  can  be  estimated  using  a  variety  of  techniques  (Haan,  1977;  Kite, 
1977).  Generally  it  is  estimated  first  in  real  space,  then  the  moment  estimators  are 
used  on  the  transformed  variate  y  =  log(x-a).  The  three-parameter  form  has  a 
density  function  of 

_f(log(x-a)-My) 

e 
(x-a)a^V^ 

and  a  distribution  function  of 


^/  X             1  \       ^\       \  (5) 

f(x)= e  y         y       )  ^  ^ 


_l_e  1  ^^^  I 


F(Y<y)=J_l_e  I  ""^^  kt  ^^^ 
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Gamma  Distributions 

Another  positively  skewed  distribution  is  the  gamma.  There  are  two  forms: 
a  two-parameter  and  a  three-parameter  form.  The  two-parameter  distribution  is 
described  by  shape,  rj,  and  scale,  A,  parameters  which  may  be  estimated  from  the 
first  two  moments  (Table  1)  (Haan,  1977;  Kite,  1977).  The  density  function  of  the 
two-parameter  form  is 


f(x)= 


X^x^-^e"^" 


r(r,) 
where,  ^=x/s^  and  f)=x^/s^ 


(7) 


The  distribution  function  for  the  two-parameter  form  is 


F(X<x)=  r  V  ^     dt 


rin) 


(8) 


The  optional  third  parameter  is  the  location  parameter,  y  derived  from  the 
coefficient  of  skewness  yielding  a  density  function  (Kite,  1977) 


f(x)=_^i^r"V{^} 


(9) 


ar(/3)[  a 

where,  ^=(2/p^)\  a=sj^^    ,  y=x-sj^ 

..H  «    ni-i    6.51    20.2]    ^2(1.48    6.77 
and  /3,=C    1+ + +Q   + 

[       "        n^  J      H   n        n\ 
Creating  a  transformed  variate  y  =  ((x-Y)/a),  the  distribution  function  simplifies  to  a 
simple  one-parameter  gamma  form 
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f^-U-t 


F(Y<y)=fi:2i_:dt  (10) 

Generalized  Extreme  Value  Distribution 

Derived  by  Jenkinson  (1954)  to  include  the  three  classical  extreme  value 
distributions,  all  forms  are  positively  skewed  with  the  primary  difference  in  the  tails 
of  the  distributions.  The  Type  I  distribution  behaves  as  if  it  were  unbounded  in 
either  direction,  the  Type  II  behaves  as  if  it  were  bounded  in  the  lower  direction  (for 
maxima),  and  Type  III  is  bounded  in  the  upper  direction  (for  maxima).  The  value 
of  the  shape  parameter,  k,  differentiates  between  the  extreme  value  types  described. 
For  k  =  0,  the  distribution  has  the  Type  I  form.  For  k<0,  the  distribution  takes  on 
the  Type  II  form,  and  for  k>0  it  has  the  Type  III  form.  The  density  functions  are 
given  as 

f(x)=  exp[-{l-k(x-0/a}^/''](-(l/k{l-(k(x-0)/aK^/''-^)(-k/a)       k#0 
=  exp[-exp{-(x-0/a}][-exp(-(x-0/a)](-l/a)  k=0 

The  distribution  functions  are 

F(X<x)  =  cxp[-{l-k{x-i)/a}'/^]  k#0 

=  exp[-exp{-(x-$)/a}]  k=0 

Estimates  of  the  parameters  of  this  distribution  can  be  obtained  via  probability 
weighted  moments  (Hosking  et  al.,  1985) 
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k  =  7.8590c  +  2.9554c  2 
^  2bi-bo  _  log2 

3b2-bo       log3 

(2brbo)k 


(13) 


a  = 


r(l+k)(l-2-'') 
^  =  bo  +  d{r(l+k)-l}/k 


'        U        '  (14) 

where,  p-  = ,  and  r=  0,1,2 

Lognormal  Mixture  Distribution 

An  alternative  to  a  single-component  distribution  for  daily  discharge  is  to  use 
a  mixture  of  two  components  as  the  parent.  The  ability  of  a  mixture  distribution  to 
represent  more  complex  frequency  distributions  is  not  surprising,  given  the  necessity 
of  estimating  five  parameters.  The  possibility  that  daily  streamflows  are  drawn  from 
mixtures  of  streamflow  generating  populations,  for  example,  snowmelt,  rainfall  or 
storage  depletion  (Hirschboeck,  1985;  Waylen,  1985b)  is  however  more  appealing 
physically. 

A  potentially  suitable  parent  distribution  is  the  two-component,  two- 
parameter,  lognormal  mixture  distribution  (Leytham,  1984).  Again,  the  distribution 
is  positively  skewed  and  bounded  by  zero  on  the  left;  the  addition  of  the  second 
component  has  the  benefit  of  extending  the  tail  of  the  distribution.  The  resulting 
density  function  for  the  parent  of  daily  discharge  is 
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f(y)=P0(y|Mi,a?)  +  (l-p)0(y|M2,^2) 


(15) 


where  0  is  the  two-parameter  lognormal  density  function  (Equation  3)  and  p  is  the 
mixing  proportion  of  the  first  component.  The  distribution  function  is 

F(Y<y)=p*  (ylMj.aJ)  +  (l-p)»  (yln^^  (16) 

where  *  is  the  two-parameter  lognormal  distribution  function  (Equation  4).  A 
primary  advantage  of  this  particular  distribution  is  that  estimation  of  the  parameters 
is  not  dependent  on  an  a  priori  classification  scheme.  Leytham  (1984)  provides  a 
complete  maximum  likelihood  estimation  procedure  using  the  expectation- 
maximization  algorithm  of  Dempster  et  al.  (1977). 


,M 


^^  =  [Pr^(yj0^^^)] 

i=l,...,n   j=l,2 
j=1.2 


1-1 


Y.v^K(y^<p''') 


.(vl) 


,(vl) 


i-1 

i=i.2 


(V) 


EwS 


11/2 


(y)\2 


Ewjr(yrMr) 


i-1 


(V) 


Ewfi 


i-1 


(17) 


The  transformed  variate  is  y=hi(x),  /ij  and  Mz  are  the  means  of  the  respective 
processes,  and  a^^  and  a^j  are  the  variances  of  the  individual  components.  Given  the 
structure  of  the  distribution,  the  first  component  always  has  the  smaller  mean. 
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Table  1.  Common  Moment  Estimators 
Mean,  m,  =-Y>^i 


2         1 


Variance,  a^  = — -J^Cx-Mx)^ 

n 

Skewness,  v,  = y^(X:-Mv)^ 

^     (n-l)(n-2)f^^  '      *^ 


Y 
Coefficient  of  skewness,  C,  =— 

s  3 

a 
Coefficient  of  variation,  C    =— i 


Spatial  Analysis 

Regionalization 

The  current  use  of  the  term  "regionalization",  in  hydrology,  refers  to  the 
"substitution  of  space  for  time"  in  improving  the  estimation  of  the  probabilities  of 
extreme  floods  (National  Research  Council,  1988).  Regionalization  is  a  form  of  the 
classification  of  data  which  have  been  collected  for  spatial  units  into  measurably 
homogeneous  regions.  Regionalization  based  on  maps  of  homogeneity  statistics  such 
as  the  coefficient  of  variation  provides  a  subjective  interpretation  of  the  spatial 
patterns,  and  the  delimitation  of  regions  of  a  continuous  variable  becomes  an  art. 
Cluster  analysis  provides  an  objective   approach   to  regionalization  based  on 
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parameters  of  the  parent  distribution.  Groups  of  rivers  whose  parameters  are  similar 

compose  hydrologic  regions. 

Cluster  analysis  is  a  generic  name  given  to  a  set  of  various  procedures  used 

to  create  a  classification  scheme  (Aldenderfer  &  Blasfield,  1984).  As  such  there  are 

a  few  potential  problems:  1)  most  methods  are  empirically  based,  and  not  supported 

by  theory,  2)  different  methods  generate  different  resultant  clusters  from  the  same 

data  set,  and  3)  although  used  to  discover  structure  in  data,  their  optimal  use 

requires  recognition  of  physically  meaningful  groupings,  as  opposed  to  those  which 

are  merely  an  artifact  of  the  numerical  procedures  (Aldenderfer  &  Blasfield,  1984). 

Spatial  Estimation 

Spatial  estimation  of  parameter  values  at  locations  with  absent  or  insufficient 
observations  is  a  problem  in  applied  geography.  The  construction  of  maps  of 
parameter  values  is  complicated  by  the  need  to  interpolate  between  observations  in 
order  to  facilitate  prediction  and  may  be  accomplished  through  a  variety  of 
algorithms.  Figure  14  shows  a  sample  grid  from  a  mapping  program  such  as 
SURFER  (Golden  Software  Inc.,  1989).  The  program  starts  with  a  series  of  random 
observations  as  marked  on  the  map,  the  grid  is  superimposed  and  the  program 
estimates  the  values  of  the  variable  at  each  grid  intersection.  Usually,  the  grid  cell 
is  used  as  the  search  radius  around  the  intersection,  and  observations  within  that 
radius  are  averaged  to  estimate  the  value  for  a  point;  when  no  observations  are 
available  nearby,  the  estimation  of  the  value  at  a  grid  intersection  is  accomplished 
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with  the  gridding  algorithm.   Golden  Software's  SURFER  program  provides  three 

techniques  for  mapping: 

1)  The  inverse  distance  algorithm  simply  weights  each  data  point's  influence 
on  another  by  the  inverse  of  the  distance  raised  to  a  specified  power  and  sums  over 
the  requisite  number  of  nearest  neighbors;  this  tends  to  be  very  fast  computationally, 
but  lacks  analytical  sophistication. 

2)  The  minimum  curvature  method  fixes  the  value  of  a  grid  point  with  an 
observation,  if  present,  and  then  repeatedly  smooths  the  surface  with  a  mathematical 
equation  until  a  specified  minimum  error  is  achieved.  This  technique  is  very 
attractive  because  it  ensures  that  observations  are  honestly  reflected  in  the  resulting 
map. 

3)  Kriging  uses  regional  variable  theory  to  provide  a  minimum  variance 
unbiased  linear  estimator  of  the  mean  characteristic  of  a  grid  cell  for  a  given 
geometry  (Golden  Software  Inc.,  1989;  Journel  &  Huijbregts,  1978).  In  practice 
however,  the  results  of  kriging  depend  on  the  skill  of  the  person  using  the  software 
and  their  famiharity  with  the  technique. 

Inverse  distance  procedures  tend  to  be  very  sensitive  to  the  search  procedure 
employed  to  include  data  points  in  the  estimation.  Expanding  the  search  to 
quadrants  or  octants  introduces  a  much  higher  degree  of  smoothing  to  the  result.  A 
nearest  neighbor  search  on  the  other  hand  may  generate  unrealistic  trends  or  slopes 
in  areas  of  limited  data,  and  translate  poorly  to  the  estimation  of  grid  points  (Golden 
Software  Inc.,  1989). 
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While  both  the  minimum  curvature  and  kriging  techniques  are  said  to  be  more 
accurate  because  they  project  trends  in  the  data  (Golden  Software  Inc.,  1989),  the 
opposite  may  in  fact  be  the  case.  Spurious  highs  or  lows  in  areas  of  limited  data  may 
result  if  trends  are  projected  from  areas  of  tightly  clustered  control  points  (Sampson, 
1978).  Kriging  also  requires  prior  determination  of  the  semivariogram,  a  function 
relating  covariance  between  points  to  the  distance  between  the  points.  The 
effectiveness  of  kriging  depends  on  the  proper  selection  of  parameters,  including  the 
semivariogram  (Sampson,  1978).  SURFER  has  a  linear  semivariogram  built-in;  this 
particular  choice  is  extremely  poor  as  it  implies  that  variance  increases  without 
bounds  with  increasing  distance  (Sampson,  1978).  Additionally,  the  choice  of 
underlying  drift  is  imperative  as  all  estimates  depend  on  this  trend;  again  this 
requires  prior  investigation. 

Concluding  Remarks 

Application  of  the  order  statistic-based  model  begins  with  identification  of  the 
parent  distribution  and  the  subsequent  estimation  of  it's  parameters.  Each  of  the 
distributions  outlined  in  this  chapter  have  the  correct  shape  for  the  distribution  of 
daily  streamflow.  The  final  choice  for  the  parent  distribution  can  be  made  on  the 
basis  of  goodness-of-fit  tests  and  the  physical  reasonableness  of  the  distribution. 

Cluster  analysis  allows  objective  identification  of  homogeneous  regions  in  the 
parameters  of  the  parent  distribution.  Results  of  this  regionalization  may  be 
interpreted  using  the  known  underlying  physical  geography  of  the  study  area,  and 
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compared  to  other  regionalization  efforts  in  the  same  area,  based  on  such  variables 

as  the  mean  annual  flood. 

Using  the  minimum  curvature  technique,  parameters  of  the  parent  distribution 

of  streamflow  for  each  basin  may  be  mapped.  Specification  of  the  degree  of  error 

allowed  in  estimation  of  the  surface,  and  the  requirement  that  observations  be 

honestly  presented  in  the  final  gridded  surface  make  this  the  best  choice  for  the  study 

area  with  it's  high  degree  of  underlying  variability.  The  resultant  maps  provide  for 

the  estimation  of  the  parameters  of  the  parent  distribution  at  any  location. 


CHAPTERS 
RESULTS 


The  statistical  model  of  flow-duration  curves  is  applied  to  all  rivers  in  the 
study  area,  a  subset  of  which  are  chosen  and  discussed  in  detail.  Results  are 
presented  as  a  series  of  maps  of  each  of  the  parameters  of  the  chosen  parent 
distribution.  The  physical  basis  of  the  observed  variation  of  the  parameters  are 
discussed,  and  the  results  of  cluster  analysis  on  the  parameters  of  the  parent 
distribution  are  presented.  A  set  of  predictions  are  made  from  the  maps,  and 
compared  to  limited  historic  records  at  these  sites. 

Four  rivers  possessing  widely  differing  flow  regimes  are  selected  to  illustrate 
the  performance  of  the  model  (Figure  15).  The  Sumas  River's  flow  regime  is 
dominated  by  flows  generated  from  winter  rains  and  to  a  lesser  extent  smaller 
summer  convective  storms.  The  basin,  situated  entirely  on  the  Eraser  Lowlands,  is 
relatively  small  with  an  area  of  149  km^  and  possesses  little  lake  storage.  The  regime 
of  Slesse  Creek  possesses  two  periods  of  high  flow,  one  during  spring  snowmelt  and 
the  other  resulting  from  large  winter  rains.  It  is  a  small  (162  km^)  mountainous  basin 
on  the  windward  side  of  the  Cascade  Mountains,  with  little  natural  storage  in  the 
form  of  lakes  and  lowlands.  The  Kitimat  River  drains  a  mountainous  basin  of  1,990 
km^  in  the  Coast  Mountains.  High  flows  are  generated  by  spring  snowmelt,  summer 
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Figure  15.  Study  Rivers,  Gage  Locations  and  Average  Annual  Hydrographs 
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glacier  melt  and  fall  rains;  there  is  little  lake  storage  within  the  basin.  The  last  river, 

the  Nazko,  located  on  the  Nechako  Plateau,  characterizes  the  dry  interior  regimes. 

There  is  a  single  high  flow  period  of  spring  snowmelt,  and  the  rest  of  the  year  is 

generally  dry  with  a  large  winter  draw-down  following  the  fall  freeze-up.  The  Nazko 

is  the  largest  of  the  four  basins  studied,  with  an  area  of  3,240  km^. 

Statistical  Model 

The  distributions  of  observed  flows  (Figure  16)  in  the  four  sample  rivers  are 
highly  skewed  and  peaked,  with  particularly  accentuated  upper  tails.  The  discharge 
classes  used  in  Figure  16  are  derived  by  dividing  the  observed  range  of  the  data  into 
twenty  equal  classes.  The  probability  distributions  listed  in  Chapter  4  are  estimated 
and  fit  to  the  observed  data  at  all  one  hundred  and  eighty  two  stations.  The  results 
of  the  application  of  the  standard  Kolmogorov-Smirnov  and  Chi-squared  goodness-of- 
fit  tests  to  the  cumulative  and  density  forms,  respectively,  of  the  four  sample  records 
are  summarized  in  Tables  2  and  3.  Not  surprisingly,  the  goodness-of-fit  statistics  are 
generally  better  using  a  2-component  lognormal  mixture  distribution  as  the  parent. 
The  distribution  is  capable  of  representing  both  the  small  inflation  of  the  frequency 
distribution  at  the  high-discharge  end  of  the  curve  and  the  relative  peakedness  of  the 
lower-discharge  end  in  all  four  diverse  hydrologic  environments  (Figure  16). 

Assuming  the  mixed  lognormal  distribution  to  be  an  adequate  parent, 
expected  flow-duration  curves  are  generated  and  visually  compared  to  the  observed 
curves  in  Figure  17.  As  the  flow-duration  curve  represents  the  mean  discharge  at  a 
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Table  2.  Parameter  Estimates  for  Parent  Distributions 
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Distribution 

Two- 
Parameter 
Lognormal 
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Parameter 
Lognormal 
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Table  3.  Mixture  Distribution  Parameters  for  Parent  Distribution 
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Samp 
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K-S 
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Statistic 

Statistic 

Sumas  R. 

11680 

P 

-4.94953 
0.490131 
0.368228 
-3.80063 
0.816581 

0.04264 

187.383 

Nazko  R. 

7300 

P 

^2 

^y2 

-7.98199 
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-6.66719 
1.08041 
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Kitimat  R. 
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Si 
P 

^2 
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P 
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-3.72620 
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•  in  log  space 
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particular  rank  it  is  reasonable  to  perform  one-sample  t-tests  of  significant  difference 

between  observed  and  predicted  data.    The  necessary  assumption  of  normality  is 

theoretically  supportable  at  intermediate  and  central  ranks,  but  not  at  the  extreme 

ranks  (Leadbetter  et  al.,  1983).  It  is  only  at  the  latter  ranks  that  the  observed  data 

and  fitted  models  can  be  considered  significantly  different  at  the  0.05  level. 

Regionalization  and  Prediction 

Results 

The  mixed  lognormal  distribution  parent  is  fit  to  the  182  stations  in  Figure  18 
and  the  estimated  parameters  are  listed  in  Appendix  B.  Parameters  are  estimated 
in  log  space  and  converted  to  real  space  by  the  following  transformations  (Kite,  1977) 


x  =  exp[y+Sy/2] 


Sx  =  ?[exp(sj)-l] 


for  each  of  the  two  sub-populations  in  the  mixture.  Maps  of  each  of  the  five 
parameters  in  real  space  (Figure  19-23),  except  for  the  mixing  proportion  (Figure  21) 
reflect  the  importance  of  annual  precipitation  (Figure  5).  In  all  cases,  the  Coast  and 
Columbia  mountain  areas  are  prominent  north  west/south  east  features,  separated 
by  the  interior  plateau. 

Larger  values  for  both  the  mean  (Figure  19)  and  standard  deviation  (Figure 
20)  of  the  smaller  sub-population  are  observed  along  the  coast.  The  mean  is  greater 
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Figure  18.  Hydrometric  Network 
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Figure  19.  Mean  of  the  First  Component  (Contour  Interval  =  0.006  m^s'^km^) 
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Figure  20.  Standard  Deviation  of  the  First  Component  (C.I.  =  0.004  m^s'^km^) 
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than  0.036  m^s'^km^  and  the  standard  deviation  above  0.02  mV^km^.  Physically  these 
correspond  to  summer  rains  in  littoral  locations.  Inland,  the  smaller  process  is  draw- 
down of  storage  within  a  basin  during  winter.  The  leeward  side  of  the  Coast 
Mountains  and  Columbia  Mountains  are  higher  in  both  mean  and  standard  deviation 
than  the  dry  interior  (mean  and  standard  deviation  less  than  0.006  and  0.004  m^s'^km^ 
respectively),  Alberta  Plateau  and  the  Rocky  Mountain  trench. 

Two  sub-parallel  ridges  of  high  values  corresponding  to  mountain  ranges 
appear  in  values  of  the  mean  (above  0.1  mV^km^)(Figure  22)  and  standard  deviation 
(0.1  mV^km^)(Figure  23)  of  the  larger  process.  These  are  generally  an  order  of 
magnitude  larger  than  the  smaller  component.  Along  the  coast,  the  larger  process 
is  generated  by  winter  rains  and  rain  on  snow.  In  the  interior,  the  larger  process 
results  from  spring  snowmelt  and  possibly  summer  glacier  melt.  The  dry  interior  and 
Alberta  Plateau  and  Rocky  Mountain  trench  all  stand  out  clearly.  The  steep  gradient 
of  processes  from  the  windward  side  of  the  Coast  Mountains  to  the  dry  interior,  and 
to  a  lesser  extent  for  the  Columbia  Mountains  is  a  dominant  feature  of  all  these 
maps. 

Figure  21  depicts  the  mixing  proportion  of  the  smaller  process  and  is  more 
difficult  to  interpret,  partially  because  the  physical  processes  correspond  to  the 
smaller  sub-population  varies  across  the  study  area.  In  general,  the  proportion  of 
time  that  the  smaller  process  is  active  increases  to  the  south  and  east  (from  0.26  to 
0.42  and  0.39  to  0.45,  respectively).  The  larger  this  parameter,  the  more  restricted 
is  the  period  of  high  flows. 
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Figure  21.  Mixing  Proportion  (C.I.  =  0.03) 
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Figure  22.  Mean  of  the  Second  Component  (C.I.  =  0.04  mV^kmO 


70 


Figure  23.  Standard  Deviation  of  the  Second  Component  (C.I.  =  0.05  mV^km^) 
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Spatial  Analysis 

Figures  19  through  23  suggest  certain  regions  of  similar  hydrologic  behavior 
as  indicated  by  the  parameters  of  the  mixture  distributions.  Objective  groupings  of 
basins  possessing  similar  sets  of  parameters  are  identified  using  the  cluster  analysis 
procedures  within  SAS  (SAS  Institute  Inc.,  1988).  Two-stage  density  linkage  cluster 
analysis  was  used  to  identify  seven  relatively  homogeneous  clusters.  This  particular 
technique  of  analysis  is  chosen  because  the  algorithm  ensures  that  all  points  are 
assigned  to  modal  clusters  before  modal  clusters  themselves  are  allowed  to  join,  and 
few  constraints  are  placed  on  the  data.  The  derived  cluster  membership  is  depicted 
in  Figure  24.  The  average  flow-duration  curves  for  each  cluster  are  constructed  in 
Figure  25,  and  the  parameters  in  each  cluster  are  summarized  in  Table  4. 

Clusters  II  and  III  are  representative  of  a  general  type  of  flow-duration  curve. 
The  means  and  standard  deviations  of  both  components  are  small  (less  than  0.005 
and  0.002,  respectively),  and  the  mixing  proportion  of  the  smaller  process  is  relatively 
large  (0.41  for  II,  and  0.46  for  III).  These  flow-duration  curves  are  typical  of 
snowmelt  dominated  flow  regimes.  Both  clusters  have  the  longest  period  of  monthly 
minimum  temperatures  below  freezing,  averaging  6.3  months.  Cluster  III  has  the 
lowest  total  annual  precipitation  of  all  clusters  with  less  than  500mm. 

Cluster  ni  is  located  geographically  in  the  south  central  part  of  the  province 
in  the  driest  area.  This  is  reflected  by  the  consistently  small  values  of  discharge  in 
the  flow-duration  curve.    The  regime  represented  is  spring  snowmelt  and  storage 


Figure  24.  Cluster  Membership  and  Regions 
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Figure  25.  Average  Flow-Duration  Curves  by  Cluster  Membership 


74 


Table  4.  Cluster  Summary  of  Parameters  of  the  Mixture  Distribution  Parent 


Number 

Standard 

Cluster 

of  Gages 

Parameter 

Mean 

Deviation 

I 

22 

Mxl 

0.0041102 

0.0015714 

''xl 

0.0016596 

0.0004972 

p 

0.4435570 

0.0057512 

Mx2 

0.0258839 

0.0058879 

<'x2 

0.0365977 

0.0153759 

n 

79 

Mxl 

0.0054826 

0.0031371 

^xl 

0.0026180 

0.0015016 

P 

0.4116332 

0.0181721 

Mx2 

0.0319591 

0.0165782 

«^x2 

0.0363247 

0.0222834 

ni 

17 

Mxl 

0.0023026 

0.0012310 

«^xl 

0.0009005 

0.0003964 

P 

0.4619863 

0.0072412 

Mx2 

0.0155354 

0.0063279 

<^x2 

0.0245944 

0.0110716 

IV 

32 

Mxl 

0.0119054 

0.0041608 

^xl 

0.0065709 

0.0026482 

P 

0.4065077 

0.0202680 

Mx2 

0.0849624 

0.0188469 

«^x2 

0.1050167 

0.0364385 

V 

11 

Mxl 

0.0056481 

0.0015614 

t^xl 

0.0024117 

0.0005285 

P 

0.4565075 

0.0080696 

Mx2 

0.0417693 

0.0061117 

o^ 

0.0658142 

0.0159323 

VI 

8 

Mxi 

0.0315927 

0.0105845 

t^xl 

0.0127372 

0.0051053 

P 

0.3905571 

0.0161944 

Mx2 

0.0976899 

0.0417523 

<^x2 

0.0731671 

0.0457388 

VII 

13 

Mxl 

0.0202548 

0.0059838 

^xl 

0.0157667 

0.0037205 

P 

0.3461062 

0.0136073 

Mx2 

0.1094643 

0.0365555 

O^ 

0.1375107 

0.0691916 
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depletion  through  the  remainder  of  the  year.  Cluster  II  is  also  a  snowmelt-type  curve 
with  a  slightly  larger  spring  snowmelt  and  extensive  period  of  storage  draw-down. 
The  second  cluster  is  spread  out  through  all  of  the  dry  interior  and  fans  out  into  most 
of  the  northern  extent  of  the  province. 

Clusters  I  and  V  have  very  large  second  component  means  (0.03  and  0.04),  but 
the  mixing  proportion  of  the  smaller  process  (0.44-0.45)  indicates  that  the  flow 
regime  represented  is  dominated  by  storage  depletion.  Annual  precipitation  is  higher 
in  these  clusters,  averaging  600mm  and  the  winter  appears  to  be  shorter,  averaging 
5.8  months  v^dth  monthly  minimum  temperatures  below  freezing. 

Cluster  V  is  a  transition  cluster:  the  larger  process  is  greater  than  the  two 
previous  mentioned  clusters,  but  the  smaller  process  is  active  longer  in  the  year. 
Cluster  I  is  very  similar  in  the  low-discharge  end  of  the  curve  to  the  predominately 
winter  draw-down  processes  of  the  previous  clusters.  This  cluster  represents  a 
transitional  type,  from  clusters  with  a  very  small  winter  process  which,  acts  for  a  long 
period  of  the  year,  to  clusters  with  a  very  large  process  which  acts  for  a  short  period 
of  the  year.  As  well,  the  flattening  at  the  top  of  the  curve  indicates  the  presence  of 
storage  acting  on  the  highest  flows. 

Clusters  IV,  VI  and  VII  are  dominated  by  very  large  second  processes  (means 
of  0.09,  0.1  and  0.11)  and  larger  first  components  than  the  two  previous  groups 
(means  0.01,  0.03  and  0.02).  Differences  between  the  flow-duration  curves  of  clusters 
within  this  group  are  represented  more  by  the  importance  and  size  of  the  smaller 
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process.    These  clusters  have  an  average  total  precipitation  of  around  1700mm, 

almost  entirely  as  rainfall,  and  the  shortest  period  of  freezing  temperatures. 

Cluster  rV  is  found  along  the  top  of  the  Columbia  Mountains  and  the  leeward 

side  of  the  Coast  Mountains.   These  rivers  have  a  very  consistent  decrease  in  the 

curve  through  all  ranks,  indicative  of  a  consistent  variability  through  the  year, 

perhaps  reflecting  the  presence  of  glaciers  and  large  semi-permanent  snow  packs. 

Cluster  Vn  has  the  largest  process  in  the  high  discharge  end  of  the  curve.   Rivers 

that  make  up  this  cluster  appear  to  be  found  along  the  tops  of  the  Coast  Mountains. 

Again  the  relatively  slow  decline  of  the  curve  to  the  low-discharge  end  shows  the 

presence  of  semi-permanent  storage  such  as  snow  packs.  Cluster  VI  has  the  highest 

low-flows  observed  in  all  clusters,  as  well  as  the  most  persistent  inflation  of  the  curve. 

These  rivers  are  located  low  along  the  windward  side  of  the  Coast  Mountains  and, 

just  above  and  interior  to,  the  Fraser  River  delta.   The  differences  between  these 

three  mountainous  clusters  is  best  reflected  by  average  climatological  conditions: 

Cluster  rV  has  an  average  total  precipitation  of  1200mm  and  approximately  5.3 

months  of  monthly  minimum  temperatures  below  freezing  and  the  other  two  clusters 

having  precipitation  totals  averaging  2000mm  and  approximately  2.3  months  with 

minimum  temperatures  below  0°C. 
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Summary 

The  model  for  flow-duration  curves  yields  parameters  which  are  physically 
interpretable  and  appear  reasonable,  with  regard  to  the  hydrologic  processes  active 
within  the  study  area.  The  resulting  distribution  for  the  parent  of  the  generating 
processes  of  streamflow  is  capable  of  representing  the  flow-duration  curves  for  a  wide 
variety  of  hydrologic  environments.  Analysis  of  the  spatial  distribution  of  the 
parameters  of  the  parent  distribution  shows  spatially  contiguous  regions  and 
therefore  it  is  reasonable  to  assume  that  predictions  are  possible. 

Predictions 

Records  on  four  rivers,  located  in  differing  physical  enviroimients  and 
previously  unused  in  the  analysis,  are  selected  as  test  stations  for  the  spatial 
estimation  of  the  parameters  of  the  parent  distribution.  The  locations  and  annual 
hydrographs,  based  on  the  available  record  are  shown  in  Figure  26.  Summary  of  the 
location  and  record  lengths  of  these  four  stations  may  be  found  in  Appendix  C.  The 
regime  of  the  Atnarko  River,  on  the  leeward  side  of  the  Coast  Mountains,  is 
dominated  by  spring  snowmelt  and  rare  fall  precipitation.  The  West  Road  River  lies 
entirely  within  the  dry  interior,  it's  regime  dominated  by  small  spring  snowmelt  peak, 
and  long  recession  through  summer  and  winter.  The  Mamquam  River  drains  the 
windward  side  of  the  south  west  region  of  the  Coast  Mountains  and  has  high  runoff 
from  spring  snowmelt,  summer  glacier  melt  and  both  fall  and  winter  rains.  Grave 
Creek  is  a  snowmelt  dominated  flow  regime  in  the  Rocky  Mountains. 
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Figure  26.  Test  Rivers,  Gage  Locations  and  Average  Annual  Hydrographs 
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The  five  parameters  of  the  parent  mixture  distribution  are  estimated  by 

interpolation  from  more  detailed  versions  of  Figures  19  -  23.  These  numbers  are 

converted  to  log  space  using  the  following  equations  (Kite,  1977) 

y=  l/21n[x7(C^l)] 

sj  =  ln(Q^l)  (19) 

where,  C^  =  sjx 

Flow-duration  curves  in  Figure  27  are  generated  from  the  resultant  parent 
distribution  parameters  (Table  5).  There  is  an  obvious  mis-specification  for  the 
Atnarko  River.  Parameter  estimates  of  both  the  larger  and  smaller  processes  yield 
a  curve  with  the  correct  shape,  but  which  consistently  over  estimates  the  mean  flow 
at  each  rank.  Performance  on  the  remaining  three  rivers  is  better:  in  general, 
estimates  for  the  smaller  process  are  very  close  to  observed  values,  while  the  larger 
process  is  consistently  over-estimated.  Grave  Creek  and  the  Mamquam  River  have 
reasonable  estimates  for  the  mean  at  high  ranks,  but  the  larger  component  has 
parameter  estimates  which  translate  to  the  over-estimation  of  the  mean  for  low 
ranks.  However,  the  shape  of  the  curves  match  that  of  the  observed  data.  The  West 
Road  River  shows  over-estimation  of  the  parameters  of  both  components  and 
consistent  overestimation  of  the  mean  by  rank,  for  all  ranks.  Assuming  a  normal 
distribution  for  all  ranks,  t-tests  are  used  with  mixed  results.  There  are  regions  of 
significant  differences  between  observed  and  predicted  means  at  the  0.05  significance 
level  for  all  four  rivers.     The  highest  ranks  (low  discharges)  for  all  four  are 
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Table  5.  Map  Predictions  of  Parameters  of  the  Mixture  Distribution  Parent 


Interpolated 

Estimated 

Percent 

Station 

Parameter 

Values 

Values 

Error 

West  Road 

Mxi 

0.00118 

0.001047 

12.7 

River 

Oxi 

0.00041 

0.000312 

31.4 

P 

0.453 

0.458530 

-1.2 

Mx2 

0.005 

0.004064 

23.0 

o,a 

0.005 

0.003953 

26.5 

Atnarko 

Mxi 

0.00648 

0.004462 

45.2 

River 

Oxl 

0.0029 

0.001714 

69.2 

P 

0.4133 

0.432598 

-4.5 

Mx2 

0.0318 

0.018045 

76.2 

°xl 

0.0288 

0.017054 

68.9 

Mamquam 

/^xl 

0.0321 

0.035405 

-9.3 

River 

^xl 

0.01218 

0.012557 

-3.0 

p 

0.3978 

0.404934 

-1.8 

Mx2 

0.1063 

0.105372 

0.9 

^x2 

0.0831 

0.076681 

8.4 

Grave 

/^xl 

0.00409 

0.004872 

-16.1 

Creek 

^xl 

0.00154 

0.001505 

2.3 

P 

0.4552 

0.463745 

-1.8 

Mx2 

0.0225 

0.019906 

13.0 

<^x2 

0.0303 

0.021319 

42.1 
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significantly  different,  despite  their  apparent  visual  fit.  The  remainder  of  all  curves 
show  fragmented  areas  where  the  t-tests  indicate  significant  differences.  The 
Mamquam  River  has  200  ranks  with  significant  differences,  predominantly  at  the 
lowest  discharges.  The  West  Road  River  has  the  overall  best  fit  with  70  ranks  at 
intermediate  and  extreme  low  discharge  ranks  showing  significant  differences.  Grave 
Creek  has  the  bottom  150  ranks  showing  as  significantly  different,  and  the  Atnarko 
has  all  ranks  except  for  the  lowest  discharges,  as  being  significantly  different. 

Conclusions 

The  statistical  model  of  flow-duration  curves  performs  very  well  for  rivers 
where  the  parameters  of  the  parent  distribution  may  be  estimated  directly  from  the 
streamflow  data.  The  use  of  the  maps  as  predictive  tools  is  restricted  largely  because 
of  very  steep  gradients  of  parameters  across  the  physiography  of  the  study  area.  The 
parameters  of  the  larger  component  are  sensitive  to  over-generalization  in  the 
process  of  creation  of  the  maps.  The  physical  nature  of  this  process  changes,  as  well 
as  the  magnitude  of  the  parameters  over  the  study  area.  While  regions  of  similar 
values  for  parameters  of  the  parent  distribution  can  be  identified  objectively,  these 
regions  cannot  be  used  to  improve  the  spatial  estimation  of  the  parameters.  Within 
each  cluster  the  physiography  and  climate  are  controlling  the  variability  in  the 
parameters  in  ways  that  caimot  be  modelled  quantitatively. 

The  choice  of  a  mixture  distribution  for  the  parent  of  streamflow  generating 
processes  is  very  good,  as  it  has  been  shown  to  represent  the  observed  frequency 
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characteristics  of  the  data,  and  the  parameters  of  the  distribution  can  be  directly 
interpreted  and  related  to  the  physical  processes  active  within  a  basin.  This  physical 
reasonableness  is  a  major  advantage  not  shared  by  the  other  distributions  identified 
as  potential  parents.  Further  improvement  in  the  fit  of  the  model  may  be  obtained 
with  further  investigation  of  the  parent  distribution  and  the  inclusion  of  additional 
processes  or  the  choice  of  a  mixture  of  different  underlying  distributions. 


CHAPTER  6 
FURTHER  IMPLICATIONS 


A  risk  estimate,  or  probability,  that  a  particular  ranked  discharge  will  be 
greater  than  some  value  in  a  given  year,  rather  than  just  the  mean  at  that  rank,  is  of 
greater  practical  value  in  hydrology.  Order  statistics  provide  a  method  by  which  the 
distribution  of  flows  at  any  rank  and  the  parameters  of  that  distribution  may  be 
estimated.  The  success  of  the  approximation  in  representing  the  mean  has  been 
demonstrated  previously.  This  chapter  presents  an  extension  of  the  order  statistics 
and  the  varying  success  of  its  application  to  the  flow-duration  curves  of  rivers  in  two 
diverse  hydrologic  environments.  Limitations  to  the  practical  application  of  the 
approach  are  discussed  in  light  of  differing  hydrologic  processes. 

Extensions  of  the  Basic  Model 

Gibbons  (1985)  presents  the  general  results  of  the  large-sample  approximation 
to  the  variance  of  the  r'th  order  statistics  X^^-^  of  a  sample  n  from  the  continuous 
distribution  Fx-  The  first  and  second  approximations  for  both  the  mean  and  variance 
are 
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First  approximation:  oj^y  g(r)f(F'^(h(r)))"^ 
Second  approximation:  0^^=  g(r)f(F-^(h(r)))-2  -  ^^^^ 

(l/4)gV)[f'(F-Hh(r)))f(F-^(h(r)))-3f 


The  approximations  are  applied  to  all  study  locations  with  very  mixed  results- 
so  much  so  that  the  technique  could  not  be  applied  universally.  Two  rivers,  the 
Sumas  and  Quesnel,  which  characterize  the  best  and  worst  hydrologic  conditions  for 
the  application  of  the  approximation,  are  shown  in  detail.  The  Quesnel  is  a  large 
basin  draining  5,000  km^  on  the  western  flanks  of  the  Columbia  Mountains  (gage 
number  64  in  Figure  18).  Its  regime  is  dominated  by  a  single  snowmelt  peak  (Figure 
28A),  while  the  large  basin  area  and  the  presence  of  two  large  lakes  immediately 
upstream  from  the  gage  act  to  produce  highly  autocorrelated  daily  flows  (r  =  0.999). 
The  Sumas  (Figure  28B),  with  a  longer  high  flow  season,  comprised  of  many 
individual  storm  peaks,  has  lower  a  autocorrelation  between  daily  flows  (r=0.90). 

Figure  29A  shows  the  observed  and  predicted  flow-duration  curves  for  the 
Sumas  River.  An  improved  estimate  for  the  predicted  mean  may  be  obtained  from 
a  nonlinear  regression  of  the  predicted  mean  against  the  observed  mean  and  rank 

^r)  =  a5Ej)r'  (21) 

to  correct  for  areas  of  mis-specification  of  the  curve.  The  lower  Figure  (29B)  shows 
the  improvement  of  the  corrected  form  in  representing  the  data,  for  all  ranks;  the 
corrected  model  curve  cannot  be  distinguished  from  the  observed  data.  This 
particular  form  of  regression  is  ideal  for  a  correction  factor  because  the  only  possible 
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Figure  30.  Quesnel  River,  Means  by  Rank.  A)  Model;  B)  Corrected  Model 
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outcomes  of  the  correction  are  positive  values.  A  multiple  linear  regression  model 
was  tried,  but  in  some  situations  the  vast  majority  of  small  observations  at  high  rank 
could  still  be  outweighed  by  very  large  discharge  observations  at  small  rank,  giving 
negative  values  for  the  mean  at  some  large  ranks. 

Table  6.  Coefficients  of  the  Regression  Correction 

E!  E  a  h  £ 

98.654  13268.302       0.9176  0.0137  0.9852 

31655.563       0.6139  -0.0996  0.7513 


Means: 
Quesnel 


Sumas 

Variances: 
Quesnel 

Sumas 


99.431 

99.152 
98.557 


21168.761       0.3033 
12362.145       0.0265 


-0.0996 

-0.1448 
-0.4667 


0.6549 
0.3276 


The  high  storage  on  the  Quesnel  River  produces  a  flattening  of  the  upper  part 
of  the  flow-duration  curve.  The  correction  equation  is  not  as  successful  in  reducing 
the  mis-specification  of  the  model  (Figure  30B)  as  in  the  case  of  the  Sumas.  The 
over-estimation  at  the  lowest  ranks  is  reduced,  but  the  under-estimation  at  the 
highest  ranks  is  exaggerated.  Table  6  presents  a  summary  of  the  regression  statistics 
to  correct  the  mean  by  rank  for  these  two  rivers.  The  R^  for  all  the  regressions  is 
above  98%;  interpretation  of  this  statistic  in  the  untransformed  data  is  difficult  as  the 
linear  relationships  are  no  longer  valid. 

Estimates  of  the  variance,  by  rank,  for  the  Sumas  river  are  shown  in  Figure 
31  A.  Variances  derived  from  Equation  20  are  smaller  than  those  observed  for 
almost  all  ranks.  Application  of  the  non-linear  regression  correction  yields  a  greatly 
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improved  fit  on  the  Sumas  (Figure  3 IB  and  Table  6)  such  that  at  no  rank  can  the 

observed  and  corrected  variances  be  said  to  be  significantly  different  at  a  level  of 

0,05  using  a  standard  x^  test.  In  making  these  tests,  it  must  be  assumed  that  the  data 

for  each  rank  are  normally  distributed;  for  the  central  and  intermediate  ranks  this 

is  theoretically  a  valid  assumption,  at  extreme  ranks  this  is  an  inappropriate 

assumption  (Leadbetter  et  al.,  1983). 

The  variance  curves  for  the  Quesnel  River  (Figure  32  A  &  B)  show  a 
profound  mis-identification  of  the  parent  distribution.  The  shape  of  the  predicted 
curve  is  wrong  and  the  predicted  variances  are  an  order  of  magnitude  too  small. 
While  it  is  possible  to  inflate  the  predicted  variances  using  the  non-linear  regression 
equation  (Figure  32B),  the  form  is  still  incorrect,  particularly  at  lower  ranks  where 
the  model  regions  of  the  curves  do  not  correspond. 

The  consistent  underestimation  of  variance  on  the  Sumas  River  is  a  function 
of  the  inflation  of  the  total  variance  in  the  time  series  resulting  from  seasonality  and 
daily  autocorrelation  of  streamflows  (Figure  28).  The  situation  is  exacerbated  in  the 
case  of  the  Quesnel  (Figure  28A),  which  is  even  more  markedly  seasonal  and 
possesses  strong  serial  correlation  (r= 0.999). 

The  two  rivers  display  very  different  characteristics  in  their  respective  time 
series.  These  differences  may  be  analyzed  in  terms  of  two  time  scales  and  the 
physical  environments:  1)  the  daily  to  monthly  patterns  of  variabihty,  2)  the  annual 
and  long  term  consistency  of  seasonality,  and  3)  the  different  physical  characteristics 
of  these  basins. 
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Figure  32.  Quesnel  River,  Variances  by  Rank.  A)  Model;  B)  Corrected  Model 
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On  a  short  term  basis,  the  Sumas  and  Quesnel  rivers  differ  in  the  degree  of 

serial  correlation;  the  higher  correlation  coefficient  of  the  Quesnel  River  (r  =  0.999) 

produces  a  smoother  annual  hydrograph  than  the  Sumas  (r  =  0.90).    The  intrinsic 

variance  in  daily  flows  does  not  map  from  the  time  domain  to  the  frequency  domain 

evenly.   The  daily  variance  in  the  Quesnel  River  is  highest  in  the  spring  snowmelt 

season  and  very  low  through  the  remainder  of  the  year.   The  Sumas  has  a  longer 

period  of  high  flow  season,  but  proportionally,  the  variance  is  smaller  during  this 

period. 

The  year-to-year  consistency  of  the  seasonality  function  controls  variance  in 
the  intermediate  ranks.  For  example,  the  Quesnel  has  a  mean  date  of  the  annual 
flood  of  day  one  hundred  and  fifty,  and  standard  deviation  of  ten  days;  the  Sumas  has 
a  mean  date  of  the  annual  flood  of  day  ten  and  a  standard  deviation  of  thirty  four 
days  (Waylen  &  Woo,  1983).  The  more  consistent  the  seasonality  function,  the 
higher  the  variance  across  all  ranks  that  may  be  expected. 

The  large  basin  area  and  lake  storage  on  the  Quesnel  River  act  in  concert  to 
create  a  large  geophysical  filter  on  the  daily  discharge  time  series.  With  a  basin  area 
one  thirtieth  of  the  Quesnel,  the  Sumas  River  shows  very  little  filtering  and 
smoothing  of  the  time  series,  therefore  basin  responses  are  rapid  to  most  inputs.  The 
nature  of  the  streamflow  generating  processes  is  also  different  for  each  basin.  The 
Sumas  River's  flows  are  generated  by  short  duration  rainfall  events,  and  their 
magnitude  and  frequency  are  greater  in  the  winter  than  the  summer.  The  Quesnel 
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River  is  dominated  by  the  spring  snowmelt,  which  in  turn  is  controlled  by  the 

migration  of  the  0°C  isotherm  up  the  flanks  of  the  Columbia  Mountains. 

Distributional  Results 

Once  the  nature  of  the  parent  distribution  of  daily  streamflow  is  known,  the 
form  of  the  probability  distribution  of  flows  at  any  rank  is  implied  by  a  combination 
of  order  statistics  and  extreme  value  theory.  The  distribution  of  individual  order 
statistics  from  an  independent  and  identically  distributed  parent  distribution  will 
belong  to  one  of  three  forms  for  extreme  orders  (Leadbetter  et  al.,  1983),  and  one 
of  two  forms  for  central  and  intermediate  orders  (Wu,  1966).  The  primary  control 
on  the  form  an  extreme  order  statistic  displays  is  the  form  of  the  parent  distribution 
(Leadbetter  et  al.,  1983;  Watts  et  al.,  1982).  Individual  distributions  at  each  set  of 
ranks  are  examined  in  more  detail  below. 

All  results  originate  from  asymptotic  theory  and  assume  sample  sizes 
approaching  infinity,  but  may  also  hold  for  large  samples.  Each  of  the  subsequent 
limit  laws  uses  notation  which  refers  to  a  standardized  variate  X^;  the  standardization 
removes  the  population  characteristics,  and  leaves  a  parameter-free  variate,  similar 
to  a  z-score. 

Asymptotic  Theory  of  Extreme  Order  Statistics 

An  observation  belongs  to  an  extreme  order  if,  for  any  fixed  rank  k,  the  limit 
of  rank  divided  by  sample  size  as  sample  size  approaches  infinity,  is  either  0  or  1 
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(Leadbetter  et  al.,  1983).  When  data  are  ordered  in  descending  order,  the  hmit  is 

0  for  the  highest  rank,  and  for  the  lowest  rank  the  Umit  is  1.  Observations  at  these 

ranks  may  be  considered  as  extreme  order  statistics. 

For  an  independent  and  identically  distributed  population  with  a  known 

underlying  parent  distribution,  the  smaller  extremes  of  the  order  statistics  will  follow 

one  of  the  three  extremal  forms  for  minima  (David,  1981;  Galambos,  1978  &  1984; 

Leadbetter  et  al.,  1983) 

P<'*„K-^„)^x}-H(x) 

where  H  is  one  of  the  three  following  extremal  types 
for  minima: 
Type  I:  H(x)  =  l-exp(-exp(x))  -oo<x<oo 


Type  II:  H(x)  =  |  l-exp(-(-x)-")  a  >^0,jc<0 

Type  III:  H(x)  =  j?        ,     „,  l^^  ,. 

'^  I  l-expC-x")  a>0,  x>0 


(22) 


Similarly,  the  larger  extremes  of  the  order  statistics  will  follow  one  of  the  three 
extremal  forms  for  maxima  (Leadbetter  et  al,,  1983) 

P^„(M„-b„)<x}-G(x) 

where  G  is  one  of  the  three  following  extremal  types 

for  maxima: 

Type  I:  G(x)  =  exp(-exp(-x))         -oo<x<oo 

TypeII:G(x)  =  j^    ,      ,,  ,  ^^«, 

^^  ^  '      [exp(-x-«)  a>0,  x>0 

Type  III:  G(x)  =  jexp(-(-x)*')  a>0,  x<0 

^  '       [1  x>0 


(23) 
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The  normalizing  constants  «„,  )S„,  a„  and  b„,  are  derived  from  the  parent  distribution. 
The  forms  of  these  constants  have  been  derived  for  most  distributions  and  are 
detailed  in  Table  7  (Leadbetter  et  al.,  1983;  Takahashi,  1987). 

Independent  and  identically  distributed  populations  are  assumed  throughout. 
However  it  has  been  shown  that  this  may  be  relaxed  and  that  the  model  is  robust  at 
least  for  a  Gaussian  process  with  regards  to  dependence  and  stationarity  (Kiimison, 
1985).  Loynes  (1965)  reports  that  the  three  limit  laws  for  the  extremes  are  the  same 
for  the  independent  and  dependent  cases.  Berman  (1962)  has  also  shown  that  the 
assumption  of  an  identical  parent  distribution  may  be  relaxed  and  the  limits  will  still 
hold. 

Asymptotic  Theory  of  Central  and  Intermediate  Order  Statistics 

An  observation  belongs  to  a  central  or  intermediate  order  if,  for  any  fixed 
rank  k,  the  limit  of  rank  divided  by  sample  size,  as  sample  size  approaches  infinity, 
is  a  constant,  and  not  equal  to  either  0  or  1  (Leadbetter  et  al.,  1983). 

For  an  independent  and  identically  distributed  population  the  central  order 
statistics  will  follow  the  normal  distribution  (Hoeffding,  1948;  Mood  et  al.,  1963) 
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P{a„(Mr"^-b„)<x}-H(x) 

where  H  is  one  of  the  four  following  types 

for  the  k"*  central  rank: 

0  x<0 
♦  (ex")  x>0,c>0,a>0 

*(-c|x|«)  x<0,c>0,a>0 

1  x>0 


H(x)  = 
H(x)  = 


H(x)  = 
H(x)  = 


*(-cJxr) 

0 

1/2 
1 


x<0,Ci>0,a>0 
x>0,C2>0,a>0 


x<-l 
-l<x<l 
x>l 


(24) 


The  intermediate  order  statistics  will  follow  either  the  normal  or  lognormal 
distribution  (Chibisov,  1964;  Leadbetter  et  al.,  1983;  Wu,  1966) 


P{a„(Ml''"^-b„)<x}-H(x) 

where  H  is  one  of  the  three  following  types 

for  the  k"'  intermediate  rank: 

„,  .  _  f*(-alog|x|)  x<0,a>0 

^^^  ~  \  1  x>0, 

0  x<0 

x>0,a>0 

-<»<X<oo 


^  ^      [i  (alogx) 
H(x)  =  *(x) 


(25) 


where  *(x)  is  a  standard  normal  distribution  and  a  >  0.  The  constants  a„  and  b„  are 
distribution  dependent;  for  a  central  rank,  they  are  derived  from  the  distribution 
function  of  the  parent  (Bahadur,  1966;  Kendall  &  Stuart,  1977).  For  an  intermediate 
rank,  the  normalizing  constants  are  distributionally  dependent  also  (Watts,  1980). 
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These  results  are  robust  with  regards  to  stationarity  and  independence  for  a  Gaussian 

parent  (Watts,  1980;  Watts  et  al.  1982). 

Table  8.  Domains  of  Attraction  by  Type 

Distribution  Type  (Parent) 

Normal 

Exponential 

Lognormal 

Gamma 

Gumbel  (Type  I),  maxima 

Gumbel  (Type  I),  minima 

Frechet  (Type  II),  maxima 

Frechet  (Type  II),  minima 

Weibull  (Type  HI),  maxima 

Weibull  (Type  III),  minima 

GEV  Type  I,  k  =  0,  maxima 

GEV  Type  II,  k  <  0,  maxima 

GEV  Type  III,  k>0,  maxima 

Reference:  Leadbetter  et  al.,  1983 

The  parent  distribution  may  belong  to  one  of  three  domains  of  attraction  for 
extreme  order  statistics,  and  one  of  two  for  central  and  intermediate  orders.  The 
criteria  for  selecting  which  extreme  value  domain  of  attraction  a  particular  parent 
distribution  belongs  to  has  been  described  in  Leadbetter  et  al.  (1983)  and  is  listed 
in  Table  8.  Many  of  the  distributions  used  commonly  in  hydrology  to  model  daily 
discharge  (as  a  stochastic  residual  in  a  deterministic  model)  belong  to  the  type  I 
domain.  The  normal,  lognormal,  gamma  and  exponential  distributions  all  limit  in  the 
extreme  orders  to  a  type  I  distribution  (Leadbetter  et  al.,  1983;  Takahashi,  1987). 
Each  of  the  three  extreme  value  distributions  limits  to  its  own  distributional  form  in 


for  Maxima 

for  Minima 

Type  I 

Type  I 

Type  I 

Type  III 

Type  I 

Type  I 

Type  I 

Type  III 

Type  I 

Type  I 

Type  I 

Type  I 

Type  II 

Type  I 

Type  I 

Type  II 

Type  III 

Type  I 

Type  I 

Typem 

Type  I 

Type  I 

Type  II 

Type  I 

Type  III 

Type  I 
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Table  7.  Normalizing  Constants  for  Maxima 
Type  I  domain  gn  hn 


Normal 


(21og  nf'^      (21og  nfl^-\/2{2\og  n)-^/^ 
(log  log  n  +  log47r) 


Exponential 

1 

logn 

Log-normal 

a„e-^" 

e**",  where  a„  and  b„  from  normal 

Gamma 

1 

log  n  +  (r?-l)log  log  n  -  log  T{r)) 

Gumbel  (Type  I) 

1 

logn 

GEV,  k  =  0 

1 

logn 

Type  II  domain 

a„ 

b„ 

Frechet  (Type  II) 

n-i/a 

0 

GEV,  k<0 

n^- 

0 

Type  III  domain 

a„ 

b„ 

WeibuU  (Type  in) 

nV- 

0 

GEV,  k>0 

nV- 

0 

References:  Leadbetter  et  al.,  1983;  Takahashi,  1987 
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the  extreme  (Leadbetter  et  al.,  1983),  thus  the  generaUzed  extreme  value  distribution 

will  have  three  possible  limiting  forms  for  the  extremes. 

The  domain  of  attraction  for  central  ranks  is  the  normal  distribution.  The 
distribution  of  the  median,  and  other  central  orders  will  have  a  normal  or  Gaussian 
form  (Bahadur,  1966;  Galambos,  1984;  Mood  et  al.,  1963;  Sarhan  &  Greenberg, 
1962).  Chibisov  (1964)  has  detailed  the  distributional  requirements  for  each  of  the 
domains  of  attraction.  The  domains  of  attraction  for  intermediate  ranks  are  less 
clearly  defined.  Order  statistics  may  have  one  of  two  distributional  forms,  the  normal 
or  lognormal  (Balkema  et  al.,  1978a  &  1978b;  Chibisov,  1964;  Wu,  1966).  Despite 
the  relatively  slow  rates  of  convergence  to  the  asymptotic  forms  associated  with 
normal  or  lognormal  parents  (Leadbetter  et  al.,  1983;  David,  1981),  use  of  asymptotic 
theory  allows  the  distribution  for  any  rank  to  be  specified  on  the  basis  of  the  parent 
distribution  and  it's  parameters.  In  the  case  of  a  mixture  parent  distribution,  the 
order  statistic-based  model  is  robust  (Loynes  (1965);  Herman  (1962)).  Under  these 
conditions,  the  normalizing  constants  are  determined  by  the  sub-population  in  the 
direction  of  interest  (e.g.  the  smaller  sub-population  for  minima,  and  the  larger  sub- 
population  for  maxima). 

Figure  33  shows  the  distribution  of  the  annual  flood  (rank  1),  the  annual 
median  (rank  182),  and  the  annual  low-flow  (rank  365)  for  the  Sumas  River. 
Parameters  have  been  estimated  using  the  method  of  moments  estimators,  from  the 
corrected  means  and  variances  derived  from  application  of  order  statistics  to  the 
mixture  lognormal  parent  distribution.   The  normalizing  constants  are  not  derived 
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directly  from  the  parent  distribution,  as  the  severe  underestimation  of  the  variance 

of  all  ranks  is  carried  through  to  the  estimation  in  the  asymptotic  results.    These 

results  are  used  to  predict  the  shape  of  the  probability  distribution  for  the  particular 

ranks:  the  type  I  extreme  value  distribution  for  both  the  annual  flood  (maxima)  and 

low-flows  (minima),  and  the  normal  distribution  for  the  annual  median.  In  all  cases 

the  predicted  distributions  cannot  be  considered  significantly  different  from  the 

observed  using  the  K-S  test  with  a  significance  level  of  0.20  (Table  9). 

Concluding  Remarks 

Identification  of  the  parent  distribution  of  daily  discharge  allows  the 
subsequent  estimation  of  rank  means  and  variances  for  the  flow-duration  curve.  The 
inclusion  of  the  estimation  of  variance  by  rank,  adds  information  about  the  structure 
and  reliability  of  the  flow-duration  curve.  Both  sets  of  information  may  be  derived 
from  the  form  and  parameters  of  the  parent  distribution,  instead  of  the  calculation 
of  each  from  the  ranked  observations  of  daily  discharge.  Figure  34  presents  the 
mean  plus  and  minus  one  standard  deviation  by  rank.  Both  the  observed  and 
corrected  model  forms  are  included  in  this  graph.  There  is  little  observable 
difference  between  the  two  sets  of  curves. 

Asymptotic  theory  may  be  used  to  predict  the  shape  of  the  probability 
distribution  of  discharge  at  any  rank  based  on  identification  of  the  parent  distribution 
alone.  Using  the  first  two  moments  for  ranked  data  generated  from  the  large  sample 
approximations,  derived  from  the  parent  distribution,  in  turn,  permits  the  estimation 
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Table  9.  Sumas  River,  Annual  Distribution  Parameters 


Distribution 

Me^n 

Standard 
deviation 

a 

£ 

Annual 
Flood 

Observed       23.303 
Predicted       23.819 

a  =  s,/ 1.283 
/3  =  X  -  0.45s^ 

9.341 
7.628 

7.28 
5.945 

19.1 
20.387 

Type  I  distribution  for  maxima 

K-S  Statistic  for  predicted  parameters 


0.1581 


Annual 
Median 


Observed 
Predicted 


2.246 
2.011 


0.77 
0.76 


Normal  distribution 

K-S  Statistic  for  predicted  parameters 


0.0833 


Annual 
Low-flow 


Observed 
Predicted 


0.665 
0.279 


0.35 
0.35 


0.273    0.822 
0.273    0.437 


a  =  s,/1.283 
^  =  X  +  0.45s, 


Type  I  distribution  for  minima 

K-S  Statistic  for  predicted  parameters 


0.1212 


Reference:  Haan,  1977 
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Figure  34.  Sumas  River,  Flow-Duration  Curve  with  Standard  Deviations 
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of  the  parameters  of  the  rank  distribution.  These  two  results,  the  parameters  and  the 

form  of  the  distribution,  by  rank,  completely  summarize  the  flow-frequency  regime 

of  a  river. 

Identification  of  the  parent  distribution  is  the  critical  step  in  achieving  these 
results.  The  mixture  distribution  has  been  shown  to  be  a  flexible  parent  distribution 
in  the  highly  seasonal  regimes  of  British  Columbia.  Performance  of  the  mixture 
distribution  in  the  prediction  of  the  rank  mean  may  be  improved  through  the  use  of 
a  non-linear  regression  equation.  The  prediction  of  variance,  by  rank  is  more 
difficult;  the  effects  of  seasonaUty  and  basin  characteristics  manifest  themselves  in 
patterns  of  variability  in  both  the  time  and  frequency  domains.  Both  seasonaHty  and 
basin  area  have  the  effect  of  inflating  the  variances  at  middle  ranks.  Lake  storage 
tends  to  reduce  or  level  the  variance  at  the  lowest  ranks. 

Estimation  of  higher  order  moments,  such  as  the  variance,  are  susceptible  to 
outliers,  especially  with  restricted  sample  sizes.  In  general  the  higher  order  moments 
are  very  unreliable  (have  a  high  variance)  for  small  sample  sizes.  For  example,  the 
variance  of  the  sample  variance  is  [M4  -  (n-3)sV(n-l)]/n,  where  M4  is  the  fourth 
moment  about  the  mean,  s  is  the  standard  deviation  and  n  is  the  sample  size  (Haan, 
1977).  The  length  of  available  record  for  the  Sumas  River  is  32  years  and  for  the 
Quesnel  River  52,  which  provide  11,680  and  18,980  daily  observations,  respectively, 
for  estimation  of  the  parent  distribution.  However,  in  estimating  the  moments  at  a 
given  rank,  sample  sizes  are  reduced  to  32  and  52  again. 
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The  mixture  model  for  the  parent  distribution  is  adequate  for  the  description 

of  many  streamflow  regimes,  however,  the  effects  of  some  geophysical  factors  such 

as  seasonality,  basin  area  and  storage  act  as  bounds  on  the  distribution  of  streamflow. 

The  parent  distribution  should  reflect  these  upper  and  lower  bounds  and  the  serial 

correlations  which  may  be  present.  In  the  cases  where  the  mixture  distribution  has 

proved  to  be  inadequate,  further  investigation  is  warranted  to  identify  the  underlying 

frequency  structures  for  daily  discharge. 


CHAPTER? 
CONCLUSIONS 


Recently  there  has  been  considerable  concern  about  the  correct  marriage  of 
statistical  methods  and  physical  understanding  of  the  streamflow  generating  processes 
(Kleme§,  1986;  Potter,  1987).  This  dissertation  addresses  the  applied  problem  of  the 
representation  and  prediction  of  flow-duration  curves  in  a  way  that  provides  a  strong 
theoretical  basis  of  analysis  and  preserves  the  physical  underpinnings  of  the 
hydrologic  processes,  thereby  enhancing  the  soundness  of  the  results.  This  is 
achieved  through  the  correct  application  of  concepts  in  physical  geography,  hydrology 
and  statistics,  which  provide  the  foundation  for  the  research  approach.  More 
specifically,  this  encompasses  seven  major  components:  1)  the  foundation  of  the 
research  upon  the  physical  geography  and  climate  of  the  region,  2)  the  use  of  the 
basic  geomorphic  unit  of  the  drainage  basin,  3)  development  of  the  model  at  basin 
scale  and  leading  to  regional  scales,  4)  the  unification  of  a  mixed  mechanism  for  the 
generation  of  flows  and  its  statistical  analog,  5)  identification  of  statistically  and 
hydrologically  homogeneous  regions,  6)  prediction  of  the  parameters  of  the  model 
for  ungaged  basins,  and  7)  parsimony. 

The  final  model  which  is  numerically  based  in  large-sample  approximations 
of  order  statistics,  duplicates  the  otherwise  empirical  construction  technique  of  the 
calendar-year  flow-duration  curve.     Moreover  the  use  of  a  mixed  lognormal 
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distribution  as  the  parent  permits  the  representation  of  a  variety  of  flow  regimes,  but 

involves  no  loss  of  analytical  power,  as  the  necessary  solutions  are  well  documented. 

The  model  is  relatively  simple  to  use,  requiring  the  straight  forward  estimation  of  five 

parameters  to  summarize  the  three  hundred  and  sixty  five  calculated  means  of  the 

flow-duration  curve.  Ultimately  these  may  be  interpreted  physically  and  correspond 

closely  to  the  average  duration  and  magnitude  of  the  streamflow  generating  processes 

in  a  basin. 

The  spatial  distribution  of  the  parent  distribution's  parameters  permit  the 
geographic  interpretation  of  the  structure  of  the  stochastic  model.  Each  show 
variations  consistent  with  the  known  physiographic  and  hydrometeorologic  gradients 
in  the  province  of  British  Columbia.  The  resultant  maps  correspond  directly  to  the 
general  climate  and  physical  variables  which  control  streamflow  generation  in  the 
study  area.  Seven  reasonably  homogeneous  sub-regions  can  be  objectively  identified 
within  the  Province,  each  of  which  corresponds  to  a  hydrologically  distinct 
environment,  both  in  terms  of  available  hydrometeorological  data  and  the  spatially 
limited  existing  studies  of  other  streamflow  variables.  Identification  of  both 
reasonably  homogeneous  regions  and  the  gradients  of  the  parameters  across  those 
regions  facilitates  spatial  interpolation  and  the  prediction  of  flow-duration  curves  for 
rivers  where  streamflow  records  are  incomplete  or  unavailable. 

The  application  of  the  model  over  a  wide  variety  of  hydrologic  environments 
demonstrates  the  robustness  of  both  the  overall  technique  and  to  a  lesser  extent  the 
selection  of  the  mixed  lognormal  distribution  as  a  parent.    The  basic  modelling 
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procedure  may  be  used   in  any  hydrologic  environment.      However,   relative 

performance  of  the  parent  appears  to  vary  with  the  local  conditions.  In  particular, 

small,  low  storage,  rainfall-dominated  regimes  seem  to  be  particularly  suited  to  this 

analysis.     This  is  particularly  noticeable  when  the  technique  is  applied  to  the 

estimation  of  the  risk  associated  with  each  ranked  discharge  as  opposed  to  solely  the 

mean  at  that  rank.    The  problem  may  be  reduced  in  the  future  by  selection  of 

alternate  parent  distributions,  although  none  of  the  standard  distributions  used  to 

represent  streamflow  performed  as  well  as  the  mixed  lognormal  in  this  study.  The 

use  of  additional  statistical  sub-populations  may  be  the  most  readily  accomplished 

means  of  improving  the  model.  Both  the  number  of  sub-populations  and  the  form 

each  takes  may  be  varied.  This  may  provide  upper  and  lower  bounds  for  streamflow 

and  intermittent  processes.  However,  the  addition  of  sub-populations  requires  the 

a  priori  identification  and  subdivision  of  the  data  by  process.    Alternately,  a  less 

physically  based  distribution  such  as  the  Wakeby  may  be  employed,  but  the 

theoretical  development  of  this  distribution  is  incomplete.   The  limitations  of  the 

statistical  theory  in  including  highly  seasonal  and  autocorrelated  data  restricts  the 

future  development  of  the  model. 

The  use  of  maps  for  spatial  estimation  and  prediction  at  ungaged  locations  is 

presently  restricted  to  areas  with  nearby  gaged  basins  in  similar  environments.  The 

steep  hydrologic  gradients  in  many  mountainous  parts  of  the  study  area  limit  the 

accuracy  of  interpolation.    The  numerical  techniques  employed  to  represent  the 
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spatial  distributions  of  parameters  are  inadequate  to  handle  the  observed  gradients 

and  the  sparsity  of  data  in  some  areas  of  the  province. 

In  summary,  a  flexible  and  convenient  probabilistic  modelling  technique  has 

been  developed  for  representation  of  the  calendar-year  flow-duration  curve.    The 

model  provides  a  way  of  summarizing  the  flow-duration  curve  with  a  restricted 

number  of  parameters,  as  well  as  providing  a  theoretical  justification  for  its 

construction.  The  inclusion  of  variance  within  the  model  offers  great  potential  for 

representing  the  frequency  characteristics  of  a  river  in  more  analytical  depth.  With 

the  addition  of  distributional  forms  for  each  rank,  the  entire  flow-frequency  regime 

of  a  river  may  be  summarized  and  represented  by  a  few  parameters.  This  in  turn  can 

be  used  to  change  a  largely  descriptive  tool  of  hydrologists  into  an  analytical  tool  for 

the  analysis  of  risk  in  streamflow  studies.   The  model  presented  maintains  strong 

links  between  statistical  theory  and  both  physical  and  operational  hydrology.  This 

ensures  a  deeper  understanding  of  the  statistical  processes  which  control  the  common 

flood  measures,  and  in  turn,  this  knowledge  of  the  parent  distribution  may  be  used 

to  strengthen  these  same  flow-frequency  measures. 
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LISTING  OF  GAGING  STATIONS 


Map 
Key 

Station 
Identifier 

1 

07EA001 

2 

07EA002 

3 

07ED001 

4 

07EE007 

5 

07FA001 

6 

07FB001 

7 

07FC001 

8 

07FD001 

9 

08CB001 

10 

08CD001 

11 

08CE001 

12 

08CG001 

13 

08CG003 

14 

08CG004 

15 

08DA005 

16 

08DB001 

Station  Name 
Latitude         Longitude 


Basin  Area 


11,100  km^ 
2,720 


Finlay  River  at  Ware 

57-25- 15°N      125-37-35'W 

Kwadacha  River  near  Ware 

57-27-00         125-38-15 

Nation  River  near  Fort  St  James 

55-12-00         124-14-00  4,350 

Parsnip  River  above  Misinchinka  River 

55-04-41  122-54-17  4,900 

Halfway  River  near  Farrel  Creek  (Lower  Station) 

56-13-40         121-28-50  9,400 

Pine  River  at  East  Pine 

55-43-12         121-12-28  12,100 

Beatton  River  near  Fort  St  John 

56-16-12         120-40-43  15,600 

Kiskatinaw  River  near  Farmington 

55-57-25         120-22-50  3,570 

Stikine  River  above  Grand  Canyon 

58-02-38         129-56-45  18,800 

Tuya  River  near  Telegraph  Creek 

58-04-20         130-49-27  3,600 

Stikine  River  at  Telegraph  Creek 

57-54-03         131-09-16  29,300 

Iskut  River  below  Johnson  River 

56-44-20         131-40-25  9,350 

Iskut  River  at  Outlet  of  Kinaskan  Lake 

57-32-00         130-12-28  1,250 

Iskut  River  above  Snippaker  Creek 

56-41-55         130-52-23  7,230 

Surprise  Creek  near  the  Mouth 

56-06-35         129-28-33  221 

Nass  River  above  Shumal  Creek 

55-12-50         129-08-20  19,200 


Record 

20yeais 

22 

20 

20 

19 

23 

22 

24 

23 

21 

23 

23 

21 

19 

19 

34 


111 


112 


17  08DC006 

18  08DD001 

19  08EB003 

20  08EB004 

21  08EC001 

22  08ED002 

23  08EE004 

24  08EE008 

25  08EF001 

26  08EF005 

27  08EG011 

28  08EG012 

29  08FA002 

30  08FB002 

31  08FB007 

32  08FF001 

33  08FF002 

34  08FF003 

35  08GA024 

36  08GD004 

37  08HA001 

38  08HA003 


Bear  River  above  Bitter  Creek 

56-02-34         129-55-30  350 

Unuk  River  near  Stewart 

56-21-05         130-41-30  1,480 

Skeena  River  at  Glen  Vowell 

55-18-09         127-40-11  25,900 

Kispiox  River  near  Hazelton 

55-28-00         127-44-31  1,870 

Babine  River  at  Babine 

55-19-00         126-37-30  6,480 

Morice  River  near  Houston 

54-07-05         127-25-26  1,910 

Bulkley  River  at  Quick 

54-37-05         126-53-55  7,360 

Goathom  Creek  near  Telkwa 

54-38-50         127-07-20  132 

Skeena  River  at  Usk 

54-37-50         128-25-40  42,200 

Zymoetz  River  above  O.K.  Creek 

54-29-00         128-19-50  2,980 

Zymagotitz  River  near  Terrace 

54-31-07         128-43-40  376 

Exchamsiks  River  near  Terrace 

54-21-47         129-18-41  370 

Wannock  River  at  Outlet  of  Owikeno  Lake 

51-40-40         127-10-30  3,940 

Bella  Coola  River  near  Hagensborg 

52-26-32         126-19-56  4,040 

Bella  Coola  River  above  Burnt  Bridge  Creek 

52-25-20         126-09-29  3,730 

Kitimat  River  below  Hirsch  Creek 

54-03-34         128-36-00  1,990 

Hirsch  Creek  near  the  Mouth 

54-03-48         128-36-00  347 

Little  Wedeene  River  below  Bowbyes  Creek 

54-08-11  128-41-24  188 

Cheakamus  River  near  Mons 

50-05-00         123-02-54  287 

Homathko  River  at  the  Mouth 

50-59-05         124-55-01  5,720 

Chemainus  River  near  Westholme 

48-52-45         123-42-07  355 

Koksilah  River  at  Cowichan  Station 

48-43-39    123-40-11       209 


20 
21 
25 
21 
39 
26 
40 
24 
45 
23 
25 
23 
29 
18 
17 
21 
21 
19 
23 
19 
33 
30 
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39  08HA010 

40  08HA016 

41  08HB004 

42  08HB014 

43  08HB024 

44  08HB029 

45  08HD001 

46  08HE006 

47  08JA002 

48  08JB002 

49  08JE001 

50  08KA001 

51  08KA004 

52  08KA005 

53  08KA007 

54  08KA008 

55  08KA009 

56  08KB001 

57  08KB003 

58  08KC001 

59  08KD001 

60  08KD003 


San  Juan  River  near  Port  Renfrew 

48-34-38         124-17-49  580  23 

Bings  Creek  near  the  Mouth 

48-47-22         123-43-28  15.50  20 

Little  Qualicum  River  at  Outlet  of  Cameron  Lake 

49-17-27         124-35-00  135  34 

Sarita  River  near  Bamfield 

48-53-34         124-57-54  162  34 

Tsable  River  near  Fanny  Bay 

49-31-03         124-50-30  113  26 

Little  Qualicum  River  near  Qualicum  Beach 

49-21-07         124-29-07  237  24 

Campbell  River  at  Outlet  of  Campbell  Lake 

50-00-08         125-23-20  1,400  37 

Zeballos  River  near  Zeballos 

50-00-52         126-50-33  181  23 

Ootsa  River  at  Ootsa  Lake 

53-37-30         125-44-00  4,450  18 

Stellako  River  at  Glenaiman 

54-00-38         125-00-01  3,600  37 

Stuart  River  near  Fort  St  James 

54-25-05         124-16-30  14,600  45 

Dore  River  near  McBride 

53-18-38         120-14-45  404  21 

Fraser  River  at  Hansard 

54-04-43         121-50-52  18,000  34 

Fraser  River  at  McBride 

53-17-10         120-06-46  6,890  30 

Fraser  River  at  Red  Pass 

52-58-55         119-00-15  1,700  31 

Moose  River  near  Red  Pass 

52-55-12         118-48-00  458  29 

McKale  River  near  940m  Contour 

53-26-41  120-13-10  252  16 

Fraser  River  at  Shelley 

54-00-40         122-37-00  32,400  37 

McGregor  River  at  Lower  Canyon 

54-13-56         121-40-00  4,770  27 

Salmon  River  near  Prince  George 

54-05-46         122-40-41  4,300  28 

Bowron  River  near  Wells 

53-15-40         121-24-50  458  24 

Willow  River  near  Willow  River 

54-04-07         122-27-50  3,110  22 
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61  08KE009        Cottonwood  River  near  Cinema 

53-09-15         122-28-30  1,910  22 

62  08KF001        Nazko  River  above  Michelle  Creek 

52-55-45         123-32-50  3,240  20 

63  08KG001        West  Road  River  near  Cinema 

53-18-40         122-53-15  12,400  17 

64  08KH001        Quesnel  River  at  Likely 

52-36-54         121-34-15  5,930  52 

65  08KH003        Cariboo  River  below  Kangaroo  Creek 

52-40-00         121-40-07  3,260  48 

66  08KH006        Quesnel  River  near  Quesnel 

52-50-37         122-13-25  11,500  41 

67  08KH010        Horsefly  River  above  McKinley  Creek 

52-17-23         121-03-37  785  25 

68  08KH014        Mitchell  River  at  Outlet  of  Mitchell  Lake 

52-50-36         120-44-30  245  17 

69  08KH019        Moffat  Creek  near  Horsefly 

52-18-49         121-24-10  539  21 

70  08LA001        Clearwater  River  near  Clearwater  Station 

51-39-20         120-03-55  10,200  45 

71  08LA004        Murtle  River  above  Dawson  Falls 

51-58-46         120-06-13  1,380  28 

72  08LA007        Clearwater  River  at  Outlet  of  Clearwater  Lake 

52-08-20         120-11-30  2,950  29 

73  08LA008        Mahood  River  at  Outlet  of  Mahood  Lake 

51-55-55         120-14-29  4,710  22 

74  08LA013        Clearwater  River  at  Outlet  of  Hobson  Lake 

52-25-18         120-19-26  904  23 

75  08LB020        Barriere  River  at  the  Mouth 

51-10-48         120-07-50  1,140  36 

76  08LB022        North  Thompson  River  near  Barriere 

51-11-10         120-09-05  17,700  39 

77  08LB047        North  Thompson  River  at  Birch  Island 

51-36-10         119-54-55  4,450  27 

78  08LB064         North  Thompson  River  at  McLure 

51-02-29         120-14-28  19,600  29 

79  08LB069        Barriere  River  below  Sprague  Creek 

51-14-50         119-55-50  624  22 

80  08LD001        Adams  River  near  Squilax 

50-57-14         119-40-25  3,080  57 

81  08LE024        Eagle  River  near  Mulakwa 

50-56-15         118-47-59  904  22 

82  08LE027        Seymour  River  near  Seymour  Arm 

51-15-45         118-56-48  805  18 
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83 

08LE031 

South  Thompson  River  at  Chase 

50-49-37         119-41-57                  16,200 

58 

84 

08LF022 

Thompson  River  at  Spences  Bridge 

50-25-15         121-20-30                  54,600 

36 

85 

08LG048 

Coldwater  River  near  Brookmere 

49-51-20         120-54-27                  316 

20 

86 

08LG056 

Guichon  Creek  above  Tunkwa  Lake  Diversion 

50-36-30         120-54-39                  78.20 

19 

87 

08MA001 

Chilko  River  near  Redstone 

52-04-12         123-32-12                  6,940 

35 

88  08MA002 

89  08ME004 

90  08MF003 

91  08MF062 

92  08MG003 

93  08MG004 

94  08MG005 

95  08MG006 

96  08MG007 

97  08MG013 

98  08MH001 

99  08MH006 

100  08MH016 

101  08MH029 

102  08MH056 

103  08MH076 

104  08MH103 


Chilko  River  at  Outlet  of  Chilko  Lake 

51-37-31         124-08-31  2,110  26 

Bridge  River  at  Lajoie  Falls 

50-50-15         122-51-24  956  21 

Coquihalla  River  near  Hope 

49-22-31  121-25-10  741  32 

Coquihalla  River  below  Needle  Creek 

49-33-33  121-06-20  85.50  20 

Green  River  near  Pemberton 

50-16-55         122-51-05  855  34 

Green  River  near  Rainbow 

50-10-15         122-55-00  195  26 

Lillooet  River  near  Pemberton 

50-20-08         122-47-58  2,160  61 

Rutherford  Creek  near  Pemberton 

50-16-00         122-52-20  179  23 

Soo  River  near  Pemberton 

50-13-30         122-53-00  283  23 

Harrison  River  near  Harrison  Hot  Springs 

49-18-40         121-48-08  7,870  35 

Chilliwack  River  at  Vedder  Crossing 

49-05-50         121-57-45  1,230  51 

North  Alouette  River  at  232nd  Street,  Maple  Ridge 

49-14-34         122-34-42  37.30  28 

Chilliwack  River  at  Outlet  of  Chilliwack  Lake 

49-05-02         121-27-24  329  54 

Sumas  River  near  Huntingdon 

49-00-09         122-13-50  149  32 

Slesse  Creek  near  Vedder  Crossing 

49-04-16         121-41-58  162  24 

Kanaka  Creek  near  Webster  Comers 

49-12-25  122-32-05  47.70  25 

Chilliwack  River  above  Slesse  Creek 

49-06-09         121-39-42  645  24 
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105  08MH104       Anderson  Creek  at  the  Mouth 

49-05-34         122-40-58  27.20 

106  08MH105       Nicomekl  River  below  Murray  Creek 

49-06-02         122-38-44  64.50 

107  08NA002        Columbia  River  at  Nicholson 

51-14-38         116-54-42  6,660 

108  08NA024        Windermere  Creek  near  Windermere 

50-29-03         115-56-02  84.20 

109  08NA045        Columbia  River  near  Fairmont  Hot  Springs 

50-19-25         115-51-45  891 

110  08ND006        Columbia  River  at  Twelve  Mile  Ferry 

50-51-11         118-06-24  28,500 

111  08ND009       Downie  Creek  near  Revelstoke 

51-28-37         118-25-32  655 

112  08ND012        Coldstream  River  below  Old  Camp  Creek 

51-40-06         118-35-49  938 

113  08ND013        lUecillewaet  River  at  Greeley 

51-00-49         118-04-57  1,170 

114  08ND014       Jordan  River  above  Kirkup  Creek 

51-02-40         118-16-09  272 

115  08NE001        Incomappleux  River  near  Beaton 

50-46-25         117-40-36  1,020 

116  08NE006        Kuskanax  Creek  near  Nakusp 

50-15-03         117-48-44  350 

117  08NE008        Beaton  Creek  near  Beaton 

50-44-09         117-43-44  99.50 

118  08NE039        Big  Sheep  Creek  near  Rossland 

49-01-00         117-56-40  347 

119  08NE074        Salmo  River  near  Salmo 

49-04-07         117-16-37  1,230 

120  08NE077        Barnes  Creek  near  Needles 

49-54-27         118-07-31  201 

121  08NE087        Deer  Creek  at  Deer  Park 

49-25-30         118-02-30  80.50 

122  08NF002        Kootenay  River  at  Canal  Flats 

50-08-52         115-47-57  5,390 

123  08NG005        Kootenay  River  at  Wardner 

49-25-13         115-25-10  13,600 

124  08NG012        St  Mary  River  at  Wycliffe 

49-36-01         115-51-46  2,360 

125  08NG042       Kootenay  River  at  Newgate 

49-00-52         115-10-27  20,000 

126  08NG046        St  Mary  River  near  Marysville 

49-36-29    116-10-07        1,480 


21 
18 
68 
20 
42 
19 
23 
23 
24 
24 
35 
24 
33 
38 
38 
37 
27 
29 
54 
38 
41 
40 
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127 

08NG051 

Skookumchuck  Creek  near  Skookumchuck 

49-54-42         115-46-03 

637 

26 

128 

08NG053 

Kootenay  River  near  Skookumchuck 

49-54-38         115-44-08 

7,120 

37 

129 

08NG065 

Kootenay  River  at  Fort  Steele 

49-36-43         115-38-04 

11,400 

24 

130 

08NH001 

Duncan  River  near  Howser 

50-17-09         116-56-44 

2,160 

36 

131 

08NH005 

Kaslo  River  below  Kemp  Creek 

49-54-27         116-57-07 

453 

27 

132 

08NH006 

Moyie  River  at  Eastport 

48-59-58         116-10-43 

1,480 

58 

133 

08NH007 

Lardeau  River  at  Marblehead 

50-15-47         116-58-02 

1,620 

43 

134 

08NH032 

Boundary  Creek  near  Porthill 

48-59-50         116-34-05 

251 

57 

135 

08NH034 

Moyie  River  at  Moyie 

49-15-30         115-51-00 

725 

23 

136 

08NH066 

Lardeau  River  at  Gerrard 

50-30-36         117-16-29 

769 

20 

137 

08NH084 

Arrow  Creek  near  Erickson 

49-09-32         116-27-04 

78.70 

24 

138 

08NH115 

Sullivan  Creek  near  Canyon 

49-06-14         116-25-35 

6.22 

21 

139 

08NfH119 

Duncan  River  below  B.B.  Creek 

50-38-17         117-02-50 

1,330 

24 

140 

08NH120 

Moyie  River  above  Negro  Creek 

49-25-20         115-56-28 

240 

23 

141 

08NJ013 

Slocan  River  near  Crescent  Valley 

49-27-38         117-33-52 

3,320 

62 

142 

08NJ014 

Slocan  River  at  Slocan  City 

49-46-08         117-28-23 

1,660 

28 

143 

08NJ061 

Redfish  Creek  near  Harrop 

49-37-20         117-03-15 

26.20 

19 

144 

08NJ129 

Fell  Creek  near  Nelson 

49-30-12         117-15-37 

4.40 

21 

145 

08NJ130 

Anderson  Creek  near  Nelson 

49-30-05         117-15-35 

9.07 

21 

146 

08NK002 

Elk  River  at  Fernie 

49-30-36         115-04-17 

3,110 

19 

147 

08NK012 

Elk  River  at  Stanley  Park 

49-18-42         115-03-04 

3,520 

25 

148 

08NK016 

Elk  River  near  Natal 

49-51-58         114-52-06 

1,870 

32 
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149  08NL004 

150  08NL007 

151  08NL036 

152  08NL038 

153  08NM015 

154  08NM134 

155  08NM142 

156  08NN002 

157  08NN012 

158  08NN013 

159  08NN015 

160  08NN019 

161  08NP001 

162  08OA002 

163  08PA001 

164  09AA006 

165  09AA010 

166  09AA013 

167  09AA014 

168  09AA015 

169  09AE003 

170  lOAAOOl 


Ashnola  River  near  Keremeos 
49-12-33         119-59-23  1,050 

Similkameen  River  at  Princeton 
49-26-47         120-31-17  1,850 

Whipsaw  Creek  below  Lamont  Creek 
49-22-09         120-34-11  185 

Similkameen  River  near  Hedley 
49-22-39         120-09-06  5,590 

Vaseux  Creek  Above  Dutton  Creek 
49-15-44         119-28-27  255 

Camp  Creek  at  Mouth  near  Thirsk 
49-42-40         120-00-30  33.90 

Coldstream  Creek  above  Municiple  Intake 
50-15-28         119-04-55  58.50 

Granby  River  at  Grand  Forks 
49-02-39         118-26-19  2,050 

Kettle  River  near  Laurier 
48-59-04         118-12-55  9,840 

Kettle  River  near  Ferry 
48-58-53         118-45-55  5,750 

West  Kettle  River  near  McCulloch 
49-42-15         119-05-31  230 

Trapping  Creek  near  the  Mouth 
49-33-52         119-03-08  144 

Flathead  River  at  Flathead 
49-00-02         114-28-35  1,110 

Yakoun  River  near  Port  Clements 
53-36-50         132-12-35  477 

Skagit  River  near  Hope 
49-02-50         121-05-45  907 

Atlin  River  near  Atlin 
59-35-57         133-48-48  6,810 

Lindeman  Creek  near  Bennett 
59-50-16         135-00-50  240 

Tutshi  River  at  Outlet  of  Tutshi  Lake 
59-56-48         134-19-29  829 

Fantail  River  at  Outlet  of  Fantail  Lake 
59-35-40         134-23-26  717 

Wann  River  near  Atlin 
59-25-55         134-12-20  269 

Swift  River  near  Swift  River 
59-55-50         131-46-04  3,320 

Liard  River  at  Upper  Crossing 
60-03-00         128-54-00  33,400 


39 
45 
23 
22 
21 
21 
20 
24 
58 
58 
22 
22 
36 
21 
25 
37 
30 
23 
24 
23 
27 
27 
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171  10AC002 

172  10AC003 

173  10AC004 

174  10AC005 

175  lOADOOl 

176  lOBAOOl 

177  lOBBOOl 

178  lOBCOOl 

179  lOBEOOl 

180  10BE004 

181  lOCBOOl 

182  lOCDOOl 


Dease  River  at  McDame 

59-11-20         129-12-44  6,940 

Dease  River  at  Outlet  of  Dease  Lake 

58-48-50         130-05-00  1,520 

Blue  River  near  the  Mouth 

59-45-30         129-07-40  1,700 

Cottonwood  River  above  Bass  Creek 

59-07-08         129-49-23  888 

Hyland  River  near  Lower  Post 

59-57-03         128-09-03  9,450 

Turnagain  River  above  Sandpile  Creek 

59-03-52         127-50-41  6,580 

Kechika  River  at  the  Mouth 

59-36-55         127-18-37  22,700 

Coal  River  at  the  Mouth 

59-41-29         126-57-02  9,190 

Liard  River  at  Lower  Crossing 

59-24-45         126-05-50  104,000 

Toad  River  above  Nonda  Creek 

58-51-20         125-22-50  2,570 

Sikanni  Chief  River  near  Fort  Nelson 

57-14-03         122-41-39  2,160 

Muskwa  River  near  Fort  Nelson 

58-47-18    122-39-33       20,300 


25 
24 
23 
24 
27 
19 
21 
24 
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24 
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29 


APPENDIX  B 
PARAMETER  ESTIMATES  FOR  PARENT  DISTRIBUTION 


First  Component 

Second  Component 

Station 

Standard 

Mixing 

Standard 

Identifier 

Mean 

Deviation 

Proportion 

Mean 

Deviation 

07EA001 

-5.58509 

0.41123 

0.42323 

-3.99613 

0.92150 

07EA002 

-5.68087 

0.48399 

0.41529 

-3.91871 

0.96804 

07ED001 

-5.65136 

0.36112 

0.45942 

-4.35923 

1.00676 

07EE007 

-4.91979 

0.49264 

0.41802 

-3.49898 

0.92683 

07FA001 

-6.53336 

0.53982 

0.40742 

-4.85883 

1.02305 

07FB001 

-5.63346 

0.51350 

0.41756 

-4.15855 

0.97986 

07FC001 

-9.36006 

1.07985 

0.38356 

-6.33483 

1.68478 

07FD001 

-8.13064 

0.77699 

0.39198 

-6.28421 

1.46855 

08CB001 

-5.94762 

0.47330 

0.41257 

-4.08546 

0.97800 

08CD001 

-6.13571 

0.43789 

0.43130 

-4.65117 

1.01151 

08CE001 

-5.96719 

0.43665 

0.41416 

-4.20159 

0.93511 

08CG001 

-4.77221 

0.53113 

0.39715 

-2.95812 

0.91103 

08CG003 

-5.68122 

0.43336 

0.40981 

-4.20637 

0.84783 

08CG004 

-5.17725 

0.52423 

0.40393 

-3.18552 

0.99132 

08DA005 

-4.97400 

0.65636 

0.39360 

-2.72621 

1.10819 

08DB001 

-4.86673 

0.60697 

0.37284 

-3.12638 

0.88524 

08DC006 

-4.53329 

0.61535 

0.38910 

-2.61485 

0.97953 

08DD001 

-4.31790 

0.54930 

0.38650 

-2.58889 

0.87493 

08EB003 

-5.38216 

0.49163 

0.40152 

-3.70473 

0.89820 

08EB004 

-5.41440 

0.60076 

0.37856 

-3.68828 

0.91307 

08EC001 

-5.80126 

0.38924 

0.39938 

-4.83917 

0.68089 

08ED002 

-4.46167 

0.44267 

0.38806 

-3.10509 

0.71455 

08EE004 

-5.29001 

0.52735 

0.37915 

-3.92045 

0.78080 

08EE008 

-6.35795 

0.66572 

0.38438 

-4.36553 

1.05896 

08EF001 

-5.31103 

0.52821 

0.38542 

-3.76482 

0.84893 

08EF005 

-4.81553 

0.50942 

0.39387 

-3.26783 

0.84404 

08EG011 

-4.10510 

0.60946 

0.35388 

-2.72718 

0.81424 

08EG012 

-3.73947 

0.67498 

0.34118 

-2.12925 

0.86772 

08FA002 

-3.53723 

0.43760 

0.37152 

-2.34439 

0.63415 

08FB002 

-4.82202 

0.44870 

0.39987 

-3.36930 

0.75548 

08FB007 

-5.10709 

0.45071 

0.40458 

-3.57205 

0.80011 
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08FF001 

-3.86103 

0.55360 

0.34846 

-2.69281 

0.74878 

08FF002 

-4.04604 

0.61878 

0.34358 

-2.81114 

0.89499 

08FF003 

-3.70292 

0.56936 

0.37228 

-2.33867 

0.82257 

08GA024 

-4.08187 

0.49154 

0.39789 

-2.56636 

0.83614 

08GD004 

-4.42166 

0.44550 

0.41608 

-2.95310 

0.84205 

08HA001 

-5.35754 

1.02527 

0.32460 

-3.14041 

1.22975 

08HA003 

-5.66095 

0.85822 

0.36359 

-3.28725 

1.23589 

08HA010 

-4.60202 

0.85760 

0.35611 

-2.66084 

1.13639 

08HA016 

-6.01423 

0.69974 

0.40493 

-3.78488 

1.35044 

08HB004 

-4.08127 

0.65492 

0.33339 

-2.73397 

0.81546 

08HB014 

-4.16954 

0.84313 

0.36229 

-2.32506 

1.17105 

08HB024 

-4.36211 

0.82825 

0.32885 

-2.87489 

1.06945 

08HB029 

-4.32112 

0.65477 

0.33503 

-3.00729 

0.79231 

08HD001 

-3.68200 

0.42089 

0.36879 

-2.70090 

0.63448 

08HE006 

-2.94211 

0.41085 

0.40924 

-1.94998 

0.79367 

08JA002 

-4.85516 

0.43446 

0.39459 

-3.68353 

0.69547 

08JB002 

-6.18511 

0.38368 

0.42658 

-5.03112 

0.77933 

08JE001 

-5.54813 

0.34753 

0.40453 

-4.57653 

0.59451 

08KA001 

-5.21438 

0.49710 

0.42618 

-3.29654 

1.06806 

08KA004 

-4.99627 

0.44623 

0.39756 

-3.52395 

0.78600 

08KA005 

-5.09807 

0.44668 

0.42315 

-3.45656 

0.92233 

08KA007 

-5.37370 

0.45587 

0.43057 

-3.53455 

1.03617 

08KA008 

-5.53050 

0.52210 

0.42435 

-3.45630 

1.14780 

08KA009 

-5.05406 

0.47963 

0.42435 

-3.40196 

1.00247 

08KB001 

-4.97940 

0.45197 

0.38929 

-3.55555 

0.74880 

08KB003 

-4.57094 

0.56650 

0.37140 

-2.95227 

0.82305 

08KC001 

-6.44277 

0.39726 

0.45543 

-5.09822 

1.06547 

08KD001 

-4.90786 

0.42162 

0.40138 

-3.79167 

0.71963 

08KD003 

-5.46184 

0.41926 

0.42514 

-4.11379 

0.86001 

08KE009 

-5.95508 

0.49563 

0.43150 

-4.37480 

1.04695 

08KF001 

-7.98199 

0.48123 

0.43195 

-6.66719 

1.08041 

08KG001 

-6.90417 

0.29135 

0.45853 

-5.83841 

0.81598 

08KH001 

-4.91148 

0.43213 

0.39550 

-3.70580 

0.70656 

08KH003 

-4.89366 

0.48486 

0.39180 

-3.45064 

0.80176 

08KH006 

-4.97673 

0.43377 

0.39656 

-3.76107 

0.70752 

08KH010 

-5.12424 

0.46755 

0.42165 

-3.66106 

0.92567 

08KH014 

-4.38613 

0.45154 

0.40741 

-2.86433 

0.81318 

08KH019 

-6.68891 

0.53445 

0.42630 

-5.17091 

1.06252 

08LA001 

-5.21565 

0.45335 

0.41334 

-3.72074 

0.86100 

08LA004 

-4.68159 

0.42683 

0.42334 

-3.40383 

0.85524 

08LA007 

-4.58657 

0.44421 

0.41458 

-2.96746 

0.88774 

08LA008 

-6.30647 

0.52411 

0.41959 

-4.95803 

0.97857 

08LA013 

-4.56399 

0.47611 

0.41097 

-2.88061 

0.88965 

08LB020 

-5.83963 

0.39820 

0.45747 

-4.41786 

1.06636 
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08LB022 

-5.22593 

0.45409 

0.40955 

-3.70431 

0.85336 

08LB047 

-4.85776 

0.48122 

0.40788 

-3.30029 

0.86933 

08LB064 

-5.21080 

0.44736 

0.41182 

-3.71706 

0.84459 

08LB069 

-5.53073 

0.43423 

0.44729 

-4.06827 

1.06437 

08LD001 

-4.96463 

0.46026 

0.38870 

-3.69886 

0.82740 

08LE024 

-4.42519 

0.44132 

0.41756 

-3.10228 

0.84293 

08LE027 

-4.57703 

0.48870 

0.41577 

-3.07257 

0.92473 

08LE031 

-5.01212 

0.35086 

0.43234 

-3.90743 

0.73657 

08LF022 

-5.42944 

0.37907 

0.42308 

-4.16434 

0.76345 

08LG048 

-5.59752 

0.52768 

0.43538 

-3.97836 

1.13916 

08LG056 

-7.92842 

0.50582 

0.42725 

-6.54305 

1.06585 

08MA001 

-5.70712 

0.40918 

0.42356 

-4.23589 

0.82383 

08MA002 

-5.20123 

0.44704 

0.40861 

-3.80017 

0.78795 

08ME004 

-5.01903 

0.57229 

0.42065 

-3.03557 

1.09594 

08MF003 

-4.19628 

0.44109 

0.40328 

-3.03815 

0.76134 

08MF062 

-4.68238 

0.43458 

0.43645 

-3.29770 

0.96346 

08MG003 

-4.26200 

0.55056 

0.37506 

-2.78231 

0.80790 

08MG004 

-4.26954 

0.45214 

0.39437 

-3.05818 

0.74829 

08MG005 

-4.16867 

0.43610 

0.41166 

-2.71006 

0.81285 

08MG006 

-4.09339 

0.53682 

0.38210 

-2.69840 

0.87930 

08MG007 

-4.15332 

0.50660 

0.38486 

-2.57547 

0.79535 

08MG013 

-3.62873 

0.37022 

0.37538 

-2.74551 

0.56632 

08MH001 

-3.66750 

0.33105 

0.40330 

-2.78027 

0.59356 

08MH006 

-4.13349 

0.70281 

0.35596 

-2.72943 

1.02104 

08MH016 

-3.65564 

0.35975 

0.39726 

-2.73292 

0.61195 

08MH029 

-4.94953 

0.49013 

0.36823 

-3.80063 

0.81658 

08MH056 

-3.72620 

0.39334 

0.39430 

-2.70569 

0.67934 

08MH076 

-4.74663 

0.81000 

0.34806 

-2.98647 

1.05663 

08MH103 

-3.67787 

0.34366 

0.40467 

-2.77393 

0.61150 

08MH104 

-4.90602 

0.31498 

0.43944 

-3.65525 

0.86376 

08MH105 

-5.64498 

0.56482 

0.42200 

-3.73126 

1.19174 

08NA002 

-5.43513 

0.34578 

0.44807 

-3.99614 

0.88271 

08NA024 

-5.36261 

0.15069 

0.44645 

-4.81808 

0.41298 

08NA045 

-5.39040 

0.25876 

0.47091 

-4.32638 

0.81754 

08ND006 

-4.85880 

0.40919 

0.42946 

-3.30834 

0.89114 

08ND009 

-4.59855 

0.48426 

0.41346 

-2.98607 

0.90519 

08ND012 

-4.73015 

0.44367 

0.42372 

-3.08896 

0.93559 

08ND013 

-4.65599 

0.44376 

0.42643 

-2.99159 

0.93880 

08ND014 

-4.45488 

0.52284 

0.41770 

-2.73283 

1.00998 

08NE001 

-4.37281 

0.45007 

0.41867 

-2.79547 

0.88403 

08NE006 

-4.61222 

0.40052 

0.45142 

-3.21779 

1.02209 

08NE008 

-4.49166 

0.31020 

0.45119 

-3.46399 

0.76848 

08NE039 

-6.14432 

0.48063 

0.45447 

-4.28071 

1.28746 

08NE074 

-5.12562 

0.40330 

0.45432 

-3.68613 

1.04792 
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08NE077 

08NE087 

08NF002 

08NG005 

08NG012 

08NG042 

08NG046 

08NG051 

08NG053 

08NG065 

08NH001 

08NH005 

08NH006 

08NH007 

08NH032 

08NH034 

08NH066 

08NH084 

08NH115 

08NH119 

08NH120 

08NJ013 

08NJ014 

08NJ061 

08NJ129 

08NJ130 

08NK002 

08NK012 

08NK016 

08NL004 

08NLX)07 

08NL036 

08NL038 

08NM015 

08NM134 

08NM142 

08NN002 

08NN012 

08NN013 

08NN015 

08NN019 

08NP001 

08OA002 

08PA001 


-5.42956 
-6.13008 
-5.46092 
-5.48371 
-5.33720 
-5.42180 
-5.13787 
-5.47161 
-5.48369 
-5.48097 
-4.66513 
-4.89401 
-6.18005 
-4.54411 
-5.53186 
-6.24815 
-4.64921 
-5.15458 
-5.67128 
-4.76848 
-6.02858 
-4.80630 
-4.51356 
-5.06220 
-5.78107 
-5.98584 
-5.35623 
-5.25357 
-5.61958 
-6.51213 
-6.00152 
-6.86764 
-6.36384 
-7.11464 
-6.71062 
-7.19594 
-6.37838 
-6.64464 
-6.90534 
-6.43532 
-6.67859 
-5.26644 
-4.08020 
-4.71184 


0.38317 
0.39532 
0.36587 
0.36794 
0.40177 
0.37077 
0.41486 
0.35630 
0.35539 
0.36697 
0.44004 
0.41651 
0.48725 
0.37612 
0.46032 
0.50956 
0.37711 
0.34964 
0.29620 
0.47400 
0.55805 
0.37121 
0.33121 
0.43604 
0.45322 
0.42260 
0.35903 
0.39875 
0.38495 
0.38050 
0.43535 
0.40961 
0.40947 
0.41804 
0.34026 
0.55205 
0.59478 
0.50019 
0.52583 
0.58018 
0.55337 
0.38800 
0.65565 
0.46366 


0.46681 
0.45972 
0.44180 
0.44155 
0.45732 
0.44520 
0.44996 
0.46283 
0.44456 
0.44479 
0.42168 
0.44115 
0.45351 
0.44048 
0.45221 
0.45886 
0.43771 
0.46912 
0.46663 
0.42563 
0.45341 
0.44695 
0.45421 
0.46044 
0.44560 
0.44730 
0.44060 
0.44391 
0.44423 
0.46954 
0.45664 
0.46465 
0.46265 
0.47833 
0.46937 
0.43217 
0.43711 
0.45060 
0.45220 
0.44497 
0.44735 
0.46554 
0.35182 
0.41480 


-4.00550 
-4.62905 
-4.05470 
-4.14310 
-3.88607 
-4.15938 
-3.63538 
-4.13439 
-4.07621 
-4.13295 
-3.05812 
-3.49136 
-4.48016 
-3.21897 
-3.95016 
-4.59561 
-3.29045 
-3.85044 
-4.59263 
-2.99522 
-4.21784 
-3.56715 
-3.34106 
-3.58726 
-4.29943 
-4.50029 
-4.15658 
-3.99484 
-4.23377 
-5.04763 
-4.49389 
-5.41544 
-4.87904 
-5.58626 
-5.54286 
-5.63897 
-4.36968 
-4.91874 
-5.09363 
-4.52481 
-4.83673 
-3.84973 
-2.75708 
-3.42993 


1.12786 
1.10819 
0.91333 
0.90498 
1.08330 
0.89697 
1.06435 
1.03189 
0.92545 
0.92721 
0.89981 
0.96612 
1.23073 
0.87486 
1.17688 
1.29139 
0.86685 
1.03467 
0.87294 
0.99936 
1.38899 
0.88365 
0.84887 
1.17081 
1.08574 
1.07302 
0.86289 
0.92881 
0.96451 
1.18520 
1.16849 
1.18582 
1.14967 
1.36532 
1.04179 
1.15504 
1.29864 
1.22297 
1.30670 
1.37587 
1.32783 
1.11824 
0.96847 
0.86150 


124 


09AA006 
09AA010 
09AA013 
09AA014 
09AA015 
09AE003 
lOAAOOl 
10AC002 
10AC003 
10AC004 
10AC005 
lOADOOl 
lOBAOOl 
lOBBOOl 
lOBCOOl 
lOBEOOl 
10BE004 
lOCBOOl 
lOCDOOl 


-5.15099 
-5.47468 
-5.30477 
-5.88749 
-5.45170 
-5.60970 
-5.88531 
-5.72054 
-5.87253 
-5.90956 
-5.58910 
-5.92080 
-5.96969 
-5.91213 
-5.86377 
-5.93025 
-5.57883 
-5.99590 
-6.72220 


0.32941 
0.60863 
0.39918 
0.58779 
0.43296 
0.40773 
0.41008 
0.41869 
0.40976 
0.44152 
0.46913 
0.45225 
0.44959 
0.38029 
0.34295 
0.39629 
0.39315 
0.45783 
0.62333 


0.40460 
0.41426 
0.41755 
0.42690 
0.44550 
0.41363 
0.41190 
0.41518 
0.42233 
0.40453 
0.41680 
0.41547 
0.41162 
0.42049 
0.43370 
0.41378 
0.43232 
0.41139 
0.39587 


-4.12317 
-3.19760 
-3.81113 
-3.53635 
-3.61129 
-4.16720 
-4.38479 
-4.14357 
-4.54267 
-4.47856 
-3.85142 
-4.20397 
-4.24210 
-4.40147 
-4.46281 
-4.40279 
-3.98207 
-4.45186 
-4.59470 


0.57035 
1.19486 
0.82221 
1.25753 
1.07966 
0.83413 
0.84038 
0.89403 
0.86112 
0.87152 
0.99834 
0.96333 
0.92943 
0.83719 
0.87073 
0.83415 
0.92758 
0.99672 
1.10659 


APPENDIX  C 
LISTING  OF  TEST  GAGING  STATIONS 


Station 

Station  Name 

Identifier 

Latitude         Longitude 

Basin  Area 

Record 

08GA054 

Mamquam  River  above  Mashiter  Creek 

49-43-44°N      123-06-20'W 

334  km^ 

18  years 

08FB006 

Atnarko  River  near  the  Mouth 

52-21-39         126-00-10 

2,430 

18 

08KG001 

West  Road  River  near  Cinema 

53-18-40         122-53-15 

12,400 

18 

08NK019 

Grave  Creek  at  the  Mouth 

49-50-36         114-51-36 

619 

18 
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